Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective

dc.contributor.authorHoxha, Genc
dc.contributor.authorMelgani, Farid
dc.contributor.authorDemir, Begüm
dc.date.accessioned2020-11-12T16:07:47Z
dc.date.available2020-11-12T16:07:47Z
dc.date.issued2020-08-03
dc.description.abstractThe performance of remote sensing image retrieval (RSIR) systems depends on the capability of the extracted features in characterizing the semantic content of images. Existing RSIR systems describe images by visual descriptors that model the primitives (such as different land-cover classes) present in the images. However, the visual descriptors may not be sufficient to describe the high-level complex content of RS images (e.g., attributes and relationships among different land-cover classes). To address this issue, in this article, we present an RSIR system that aims at generating and exploiting textual descriptions to accurately describe the relationships between the objects and their attributes present in RS images with captions (i.e., sentences). To this end, the proposed retrieval system consists of three main steps. The first step aims to encode the image visual features and then translate the encoded features into a textual description that summarizes the content of the image with captions. This is achieved based on the combination of a convolutional neural network with a recurrent neural network. The second step aims to convert the generated textual descriptions into semantically meaningful feature vectors. This is achieved by using the recent word embedding techniques. Finally, the last step estimates the similarity between the vectors of the textual descriptions of the query image and those of the archive images, and then retrieve the most similar images to the query image. Experimental results obtained on two different datasets show that the description of the image content with captions in the framework of RSIR leads to an accurate retrieval performance.en
dc.description.sponsorshipEC/H2020/759764/EU/Accurate and Scalable Processing of Big Data in Earth Observation/BigEarthen
dc.identifier.eissn2151-1535
dc.identifier.issn1939-1404
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/11916
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-10807
dc.language.isoenen
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subject.ddc004 Datenverarbeitung; Informatikde
dc.subject.otherconvolutional neural networken
dc.subject.otherdeep learningen
dc.subject.otherimage captioningen
dc.subject.otherimage retrievalen
dc.subject.otherrecurrent neural networken
dc.subject.otherremote sensingen
dc.subject.othersemantic gapen
dc.titleToward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspectiveen
dc.typeArticleen
dc.type.versionpublishedVersionen
dcterms.bibliographicCitation.doi10.1109/JSTARS.2020.3013818en
dcterms.bibliographicCitation.journaltitleIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensingen
dcterms.bibliographicCitation.originalpublishernameInstitute of Electrical and Electronics Engineers (IEEE)en
dcterms.bibliographicCitation.originalpublisherplaceNew York, NYen
dcterms.bibliographicCitation.pageend4475en
dcterms.bibliographicCitation.pagestart4462en
dcterms.bibliographicCitation.volume13en
tub.accessrights.dnbfreeen
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Remote Sensing Image Analysis Groupde
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Remote Sensing Image Analysis Groupde
tub.affiliation.instituteInst. Technische Informatik und Mikroelektronikde
tub.publisher.universityorinstitutionTechnische Universität Berlinen

Files

Original bundle
Now showing 1 - 1 of 1
Loading…
Thumbnail Image
Name:
09154525.pdf
Size:
5.88 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
5.75 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections