Please use this identifier to cite or link to this item:
http://hdl.handle.net/10609/121326
Title: | Automatic query expansion for vehicle repair documents through user behavior |
Author: | Ghiringhelli, Juan Carlos |
Tutor: | Bouayad-Agha, Nadjet |
Abstract: | The process of Information Retrieval (IR) by query driven search engines have become an essential part of the customer experience in any data related digital product. The accuracy and completeness of the search results is a matter of great interest and a crucial key performance indicator. An important enhancer for search engines is query expansion Query Expansion (QE), where equivalent search queries Equivalent Search Query (ESQ) are added to the original request to increase recall. ESQs can be discovered using the same tools as synonym discovery given certain considerations, taking advantage of the fact that synonym discovery is a well developed field of Natural Language Processing (NLP) with many available techniques. The motivation for this project is to use the tools available in NLP Machine Learning (ML) to automatically detect ESQs. For this a large sample of logs describing search query customer behavior was used. This data set was obtained from a live enterprise product that publishes repair documents for automobiles. Graph embeddings through an implementation method called node2Vec and vector cosine similarity is the chosen discovery method for the ESQs. The conclusion of the experiment is that while usable search expansion queries are discovered, extra human intervention or further automatic selection is necessary to filter the valuable cases from the large number of found cases, even working within a strict similarity threshold. |
Keywords: | nlp embeddings query expansion synonyms |
Document type: | info:eu-repo/semantics/masterThesis |
Issue Date: | 24-Jun-2020 |
Publication license: | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
Appears in Collections: | Bachelor thesis, research projects, etc. |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
jghiringhelliTFM0620memory.pdf | Memory of TFM | 3,9 MB | Adobe PDF | View/Open |
Share:
This item is licensed under a Creative Commons License