{"title":"图像蒙太奇总结","authors":"Rishab Lamba, Sahil Lamba","doi":"10.1109/ICCCIS48478.2019.8974558","DOIUrl":null,"url":null,"abstract":"Describing an image’s content automatically is a basic issue in artificial intelligence that links computer vision and processing of natural language. Recently, however, less attention has been given to extracting summaries from a set of associated pictures that can provide much better data. This paper presents an abstractive summary model with an Encoder-Decoder hierarchy that simultaneously sums up a gallery of pictures and matches phrases and pictures in summaries. The model is designed in order to enhance the probability of the destination identification sentence given the teaching picture. The precision of the model and the fluency of the language learned so only from image descriptions are demonstrated in experiments on various datasets. Our model is often quite precise and we check it in qualitative and quantitative terms. A recent study on neural summarization shows the power of the encoder-decoder model for picture and document overview. Experiments demonstrate that our model is better than neural abstraction and extraction techniques by producing better informative summaries of the collection of images.","PeriodicalId":436154,"journal":{"name":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image Montage Summarization\",\"authors\":\"Rishab Lamba, Sahil Lamba\",\"doi\":\"10.1109/ICCCIS48478.2019.8974558\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Describing an image’s content automatically is a basic issue in artificial intelligence that links computer vision and processing of natural language. Recently, however, less attention has been given to extracting summaries from a set of associated pictures that can provide much better data. This paper presents an abstractive summary model with an Encoder-Decoder hierarchy that simultaneously sums up a gallery of pictures and matches phrases and pictures in summaries. The model is designed in order to enhance the probability of the destination identification sentence given the teaching picture. The precision of the model and the fluency of the language learned so only from image descriptions are demonstrated in experiments on various datasets. Our model is often quite precise and we check it in qualitative and quantitative terms. A recent study on neural summarization shows the power of the encoder-decoder model for picture and document overview. Experiments demonstrate that our model is better than neural abstraction and extraction techniques by producing better informative summaries of the collection of images.\",\"PeriodicalId\":436154,\"journal\":{\"name\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"volume\":\"108 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCIS48478.2019.8974558\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS48478.2019.8974558","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Describing an image’s content automatically is a basic issue in artificial intelligence that links computer vision and processing of natural language. Recently, however, less attention has been given to extracting summaries from a set of associated pictures that can provide much better data. This paper presents an abstractive summary model with an Encoder-Decoder hierarchy that simultaneously sums up a gallery of pictures and matches phrases and pictures in summaries. The model is designed in order to enhance the probability of the destination identification sentence given the teaching picture. The precision of the model and the fluency of the language learned so only from image descriptions are demonstrated in experiments on various datasets. Our model is often quite precise and we check it in qualitative and quantitative terms. A recent study on neural summarization shows the power of the encoder-decoder model for picture and document overview. Experiments demonstrate that our model is better than neural abstraction and extraction techniques by producing better informative summaries of the collection of images.