Authors: Gengchen Mai*, University of California - Santa Barbara, CA, Bo Yan, Univerity of California - Santa Barbara, CA, Krzysztof Janowicz, University of California - Santa Barbara, CA, Rui Zhu, University of California - Santa Barbara, CA
Topics: Geographic Information Science and Systems
Keywords: Unanswerable Geographic Question, Question Answering, Knowledge Graph Embedding, Spatially Explicit Model
Session Type: Paper
Start / End Time: 1:10 PM / 2:50 PM
Room: Capitol Room, Omni, East
Presentation File: No File Uploaded
Recent years have witnessed a rapid increase in Question Answering (QA) research and products in both academic and industry. However, geographic question answering remained nearly untouched although geographic questions account for a substantial part of daily communication. Compared to general QA systems, geographic QA has its own uniqueness, one of which can be seen during the process of handling unanswerable questions. Since users typically focus on the geographic constraints when they ask questions, if the question is unanswerable based on the knowledge base used by a QA system, users should be provided with a relaxed query which takes distance decay into account during the query relaxation and rewriting process. In this work, we present a spatially explicit translational knowledge graph embedding model called TransGeo which utilizes an edge-weighted PageRank and sampling strategy to encode the distance decay into the embedding model training process. This embedding model is further applied to relax and rewrite unanswerable geographic questions. We carry out two evaluation tasks: link prediction as well as query relaxation/rewriting for an approximate answer prediction task. A geographic knowledge graph training/testing dataset, DB18, as well as an unanswerable geographic query dataset, GeoUQ, are constructed. Compared to four other baseline models, our TransGeo model shows substantial advantages in both tasks.