Computer Science > Information Retrieval

arXiv:2104.11760 (cs)

[Submitted on 23 Apr 2021 (v1), last revised 10 May 2021 (this version, v2)]

Title:DeepCAT: Deep Category Representation for Query Understanding in E-commerce Search

Authors:Ali Ahmadvand, Surya Kallumadi, Faizan Javed, Eugene Agichtein

View PDF

Abstract:Mapping a search query to a set of relevant categories in the product taxonomy is a significant challenge in e-commerce search for two reasons: 1) Training data exhibits severe class imbalance problem due to biased click behavior, and 2) queries with little customer feedback (e.g., tail queries) are not well-represented in the training set, and cause difficulties for query understanding. To address these problems, we propose a deep learning model, DeepCAT, which learns joint word-category representations to enhance the query understanding process. We believe learning category interactions helps to improve the performance of category mapping on minority classes, tail and torso queries. DeepCAT contains a novel word-category representation model that trains the category representations based on word-category co-occurrences in the training set. The category representation is then leveraged to introduce a new loss function to estimate the category-category co-occurrences for refining joint word-category embeddings. To demonstrate our model's effectiveness on minority categories and tail queries, we conduct two sets of experiments. The results show that DeepCAT reaches a 10% improvement on minority classes and a 7.1% improvement on tail queries over a state-of-the-art label embedding model. Our findings suggest a promising direction for improving e-commerce search by semantic modeling of taxonomy hierarchies.

Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2104.11760 [cs.IR]
	(or arXiv:2104.11760v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2104.11760

Submission history

From: Ali Ahmadvand [view email]
[v1] Fri, 23 Apr 2021 18:04:44 UTC (496 KB)
[v2] Mon, 10 May 2021 06:12:02 UTC (496 KB)

Computer Science > Information Retrieval

Title:DeepCAT: Deep Category Representation for Query Understanding in E-commerce Search

Submission history

Access Paper:

Additional Features

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:DeepCAT: Deep Category Representation for Query Understanding in E-commerce Search

Submission history

Access Paper:

Additional Features

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators