Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026640
M. Sikora, B. Ivančević, K. Jambrošić
In this paper, the new approach to the reviving of ancient buildings is described Photorealistic visualization is performed to create an exact visual reproduction of the building. To make the virtual environment complete a 3D acoustical computer simulation is also performed. The results of acoustic simulation can be used to make an auralization of the building. This approach has been carried out on the atrium of "Knezev dvor", one of the most famous buildings in Dubrovnik.
{"title":"Use of acoustic simulation and visualization for revitalization of ancient buildings","authors":"M. Sikora, B. Ivančević, K. Jambrošić","doi":"10.1109/VIPROM.2002.1026640","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026640","url":null,"abstract":"In this paper, the new approach to the reviving of ancient buildings is described Photorealistic visualization is performed to create an exact visual reproduction of the building. To make the virtual environment complete a 3D acoustical computer simulation is also performed. The results of acoustic simulation can be used to make an auralization of the building. This approach has been carried out on the atrium of \"Knezev dvor\", one of the most famous buildings in Dubrovnik.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126994999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026672
K. Mohana Rao, A. Dempster
Many techniques, both conventional and morphological, have been proposed in the literature for the segmentation of images. Morphological image segmentation methods, particularly those using a watershed algorithm, have found wide applications. The popular method of F. Meyer and S. Beucher (see J. Visual Comm. Image Representation., vol.1, p.21-46, 1990), based on a marker constrained watershed algorithm which uses only the binary image and its distance transform, can lead to over-segmentation or under-segmentation if the image objects are of irregular shape and they are overlapping or touching. Another problem with the Beucher and Meyer method is that the watersheds constructed may not agree/coincide with the actual image object junctions as only the binary image is used and information contained in the gray scale image is not utilised. We present an approach addressing both these problems. Morphological area-opening on the distance transform is used for choosing markers to avoid over-segmentation and under-segmentation. The second problem is addressed by using a gray scale image instead of a distance transform to force the watersheds to coincide with the actual junctions of the image objects. The proposed procedure has been applied to images with almost similar objects and the results of our procedure of segmentation are found to be very efficient and encouraging.
{"title":"Modification on distance transform to avoid over-segmentation and under-segmentation","authors":"K. Mohana Rao, A. Dempster","doi":"10.1109/VIPROM.2002.1026672","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026672","url":null,"abstract":"Many techniques, both conventional and morphological, have been proposed in the literature for the segmentation of images. Morphological image segmentation methods, particularly those using a watershed algorithm, have found wide applications. The popular method of F. Meyer and S. Beucher (see J. Visual Comm. Image Representation., vol.1, p.21-46, 1990), based on a marker constrained watershed algorithm which uses only the binary image and its distance transform, can lead to over-segmentation or under-segmentation if the image objects are of irregular shape and they are overlapping or touching. Another problem with the Beucher and Meyer method is that the watersheds constructed may not agree/coincide with the actual image object junctions as only the binary image is used and information contained in the gray scale image is not utilised. We present an approach addressing both these problems. Morphological area-opening on the distance transform is used for choosing markers to avoid over-segmentation and under-segmentation. The second problem is addressed by using a gray scale image instead of a distance transform to force the watersheds to coincide with the actual junctions of the image objects. The proposed procedure has been applied to images with almost similar objects and the results of our procedure of segmentation are found to be very efficient and encouraging.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134401292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026628
M. Stachowicz, D. Lemke
Color provides a wealth of information for interpretation of image content. The increased availability of affordable digital color cameras has created the opportunity to explore the degree to which color is useful in computer vision. This paper shows that a system for image segmentation and classification can be created using color as the primary feature. This system is comprised of two phases: segmentation and classification. In the first step, an image is searched with a blob detection algorithm to determine the location of any possible foreground elements. These areas are extracted from the image to be used in the next step. Classification is done using a set of eight color features that are optimally selected for each database. The appropriate feature vector is created for each foreground area removed from the original image. The vector is then compared to a preconstructed database to be identified. For this paper USA postage stamps on envelopes were used as the test cases.
{"title":"Image segmentation and classification using color features","authors":"M. Stachowicz, D. Lemke","doi":"10.1109/VIPROM.2002.1026628","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026628","url":null,"abstract":"Color provides a wealth of information for interpretation of image content. The increased availability of affordable digital color cameras has created the opportunity to explore the degree to which color is useful in computer vision. This paper shows that a system for image segmentation and classification can be created using color as the primary feature. This system is comprised of two phases: segmentation and classification. In the first step, an image is searched with a blob detection algorithm to determine the location of any possible foreground elements. These areas are extracted from the image to be used in the next step. Classification is done using a set of eight color features that are optimally selected for each database. The appropriate feature vector is created for each foreground area removed from the original image. The vector is then compared to a preconstructed database to be identified. For this paper USA postage stamps on envelopes were used as the test cases.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134502838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026634
V. Srisarkun, Joan Cooper
A method to handle searching a face image database (FID) is proposed to support police officers when searching criminal records from a central registration database system (CRDS). The proposed method assumes that each FID consists of a fixable object and object correlation. The proposed method employs a database search, so that all images with a similarity-based measure are retrieved. Consequently, the proposed method is much faster than sequential searching, especially when an additional set of attributes, like scar, is defined. Moreover it requires less storage space.
{"title":"Face recognition using a similarity-based measure in image database for crime investigation","authors":"V. Srisarkun, Joan Cooper","doi":"10.1109/VIPROM.2002.1026634","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026634","url":null,"abstract":"A method to handle searching a face image database (FID) is proposed to support police officers when searching criminal records from a central registration database system (CRDS). The proposed method assumes that each FID consists of a fixable object and object correlation. The proposed method employs a database search, so that all images with a similarity-based measure are retrieved. Consequently, the proposed method is much faster than sequential searching, especially when an additional set of attributes, like scar, is defined. Moreover it requires less storage space.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"319 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122738186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026643
F. Chin, C. S. Xydeas
This paper presents a perceptual image quality assessment metric based on the visible differences predictor (VDP) (Daly, 1993). The proposed metric operates in two modes. In quality mode, the metric produces a single value representation of the overall visible image quality (VIQ), whereas in distortion mode, the metric provides four distortion values each associated with a specific type of image content. The correlation coefficient between the objective VIQ and subjective test results is 0.96, compared to 0.86 obtained by the PSNR measure.
{"title":"Dual-mode image quality assessment metric","authors":"F. Chin, C. S. Xydeas","doi":"10.1109/VIPROM.2002.1026643","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026643","url":null,"abstract":"This paper presents a perceptual image quality assessment metric based on the visible differences predictor (VDP) (Daly, 1993). The proposed metric operates in two modes. In quality mode, the metric produces a single value representation of the overall visible image quality (VIQ), whereas in distortion mode, the metric provides four distortion values each associated with a specific type of image content. The correlation coefficient between the objective VIQ and subjective test results is 0.96, compared to 0.86 obtained by the PSNR measure.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128585137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026639
Xia Mao, Bin Chen, Gang Zhu, T. Hoshino
This paper aims at probing into the mapping relationship and rules between painting and music. After the examination of corresponding relation between image and music, relevant arithmetic is put forward in accordance with certain principles in scientific and art, in which the original information in the image is converted into musical information, thus enabling people to obtain visual impression from audio objects and achieving the integration of science and art.
{"title":"Study on transforming from painting to music","authors":"Xia Mao, Bin Chen, Gang Zhu, T. Hoshino","doi":"10.1109/VIPROM.2002.1026639","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026639","url":null,"abstract":"This paper aims at probing into the mapping relationship and rules between painting and music. After the examination of corresponding relation between image and music, relevant arithmetic is put forward in accordance with certain principles in scientific and art, in which the original information in the image is converted into musical information, thus enabling people to obtain visual impression from audio objects and achieving the integration of science and art.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133568011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026646
M. Jacymirski, P. Szczepaniak
The paper presents a neural network architecture for linear filtering of signals based on a scheme for fast calculation of orthogonal transformations. Compared to the known standard filtering schemes the network offers high training efficiency accompanied by substantial simplicity of realization. Additionally, the same basic network topology enables realization of diverse transformations.
{"title":"Neural realization of fast linear filters","authors":"M. Jacymirski, P. Szczepaniak","doi":"10.1109/VIPROM.2002.1026646","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026646","url":null,"abstract":"The paper presents a neural network architecture for linear filtering of signals based on a scheme for fast calculation of orthogonal transformations. Compared to the known standard filtering schemes the network offers high training efficiency accompanied by substantial simplicity of realization. Additionally, the same basic network topology enables realization of diverse transformations.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129796015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026663
Z. Rozsnyik, I. Loványi
A novel blind watermarking method for 8 bit grayscale images in 8/spl times/8 block DCT (discrete cosine transform) domain is presented. The method is based on an advanced spread-spectrum algorithm (see Cox, I.J. et al., NEC Research Institute Technical Report p.95-100, 1995). However, instead of correlation analysis in the detection phase, it extracts the difference between the histogram properties of pairs of subchannels. Histogram transformation is not reversible, which provides added security. Each channel is built from different DCT coefficients for the 8/spl times/8 block and the histogram describes their distribution. There are 63 AC and one DC coefficients. Channels are partitioned into some pairs of subchannels, which contain a single bit mark, thus the number of possible watermarks equals the power of 2 of the sub-channel pairs. We do not use all the channels - only the robust ones. Subchannels are generated by a key. Embedding does not change the image very much as we use a special embedding function. Watermarks currently resist JPEG compression up to QF (quality factor) 5.
{"title":"Blind watermarking method using partitioned DCT channels","authors":"Z. Rozsnyik, I. Loványi","doi":"10.1109/VIPROM.2002.1026663","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026663","url":null,"abstract":"A novel blind watermarking method for 8 bit grayscale images in 8/spl times/8 block DCT (discrete cosine transform) domain is presented. The method is based on an advanced spread-spectrum algorithm (see Cox, I.J. et al., NEC Research Institute Technical Report p.95-100, 1995). However, instead of correlation analysis in the detection phase, it extracts the difference between the histogram properties of pairs of subchannels. Histogram transformation is not reversible, which provides added security. Each channel is built from different DCT coefficients for the 8/spl times/8 block and the histogram describes their distribution. There are 63 AC and one DC coefficients. Channels are partitioned into some pairs of subchannels, which contain a single bit mark, thus the number of possible watermarks equals the power of 2 of the sub-channel pairs. We do not use all the channels - only the robust ones. Subchannels are generated by a key. Embedding does not change the image very much as we use a special embedding function. Watermarks currently resist JPEG compression up to QF (quality factor) 5.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129958073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026679
J. Krivic, F. Solina
This paper proposes a technique for object recognition using superquadric built models. Superquadrics, which are three dimensional models suitable for part-level representation of objects, are reconstructed from range images using the recover- and-select paradigm. Using an interpretation tree, the presence of an object in the scene from the model database can be hypothesized. These hypotheses are verified by projecting and refitting the object model to the range image which at the same time enables a better localization of the object in the scene.
{"title":"Part-level object recognition","authors":"J. Krivic, F. Solina","doi":"10.1109/VIPROM.2002.1026679","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026679","url":null,"abstract":"This paper proposes a technique for object recognition using superquadric built models. Superquadrics, which are three dimensional models suitable for part-level representation of objects, are reconstructed from range images using the recover- and-select paradigm. Using an interpretation tree, the presence of an object in the scene from the model database can be hypothesized. These hypotheses are verified by projecting and refitting the object model to the range image which at the same time enables a better localization of the object in the scene.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125212988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/VIPROM.2002.1026621
D. Kedmenec, V. Kedmenec, M. Grgic, B. Zovko-Cihlar
Unified access to VPNs enables considerable flexibility in the way remote locations connect to a VPN. The overall design goal of this approach requires support for all services inherent to VPNs, including multimedia ones. The classical approach to building VPNs, such as ATM or frame relay, can enable strong support for multimedia traffic because of the ability to provide strict QoS guarantees. The architectural requirement of using IP as network protocol can lead to problems in the transport of multicast streams for VPNs based on classical networks. New approaches, such as MPLS VPNs, promise even greater flexibility and scalability through the use of native means for IP packet delivery. Methods that can provide multicast traffic transport in those approaches need to be investigated. Different access technologies have different characteristics regarding the ability to deliver multimedia traffic and are investigated in terms of their usefulness for such purposes.
{"title":"Enabling multimedia services in unified access network","authors":"D. Kedmenec, V. Kedmenec, M. Grgic, B. Zovko-Cihlar","doi":"10.1109/VIPROM.2002.1026621","DOIUrl":"https://doi.org/10.1109/VIPROM.2002.1026621","url":null,"abstract":"Unified access to VPNs enables considerable flexibility in the way remote locations connect to a VPN. The overall design goal of this approach requires support for all services inherent to VPNs, including multimedia ones. The classical approach to building VPNs, such as ATM or frame relay, can enable strong support for multimedia traffic because of the ability to provide strict QoS guarantees. The architectural requirement of using IP as network protocol can lead to problems in the transport of multicast streams for VPNs based on classical networks. New approaches, such as MPLS VPNs, promise even greater flexibility and scalability through the use of native means for IP packet delivery. Methods that can provide multicast traffic transport in those approaches need to be investigated. Different access technologies have different characteristics regarding the ability to deliver multimedia traffic and are investigated in terms of their usefulness for such purposes.","PeriodicalId":223771,"journal":{"name":"International Symposium on VIPromCom Video/Image Processing and Multimedia Communications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115335908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}