Selected Publications
2023
-
Towards Consistent Large Language Models Using Declarative Constraints
Jasmin Mousavi and Arash Termehchy
The Proceedings of VLDB Workshop on Databases and Large Language Models (LLMDB), August, 2023.
-
Creating Data Integration Queries Using Large Language Models
Christopher Buss, Jasmin Mousavi, Mikhail Tokarev, Arash Termehchy, David Maier, and Stefan Lee
The Proceedings of VLDB Workshop on Databases and Large Language Models (LLMDB), August, 2023.
-
Effective Entity Augmentation By Querying External Data Sources
Christopher Buss, Jasmin Mousavi, Mikhail Tokarev, Arash Termehchy, David Maier, and Stefan Lee
The Proceedings of VLDB Endowment (PVLDB), Vol16, 2023.
-
Exploratory Training: When Annotators Learn About Data [Slides]
Rajesh Shreshta,Omeed Habibelahian, Arash Termehchy, and Paolo Papotti
The Proceedings of the ACM on Management of Data (SIGMOD), Vol2, 2023.
-
When Can We Ignore Missing Data in Model Training? [Slides]
Cheng Zhen, Amandeep Singh Chabada, Arash Termehchy
In Proceedings of SIGMOD Workshop on Data Management for End-to -End Machine Learning (DEEM), June 2023.
2022
-
Exploratory Training: When the Trainers Learn
Omeed Habibelahian, Rajesh Shrestha, Arash Termehchy, and Paolo Papotti
The Proceedings of SIGMOD Workshop on Human-In-the-Loop Data Analytics (HILDA)
-
RTX-KG2: A System for Building a Semantically Standardized Knowledge Graph for Translational Biomedicine
E. C. Wood, Amy K. Glen, Lindsey G. Kvarfordt, Finn Womack, Liliana Acevedo, Timothy Yoon, Chunyu Ma, Veronica Flores, Meghamala Sinha, Yodsawalai Chodpathumwan, Arash Termehchy, Jared C. Roach, Luis
Mendoza, Andrew S. Hoffman, Eric Deutsch, David Koslicki, Stephen A. Ramsey
BMC Bioinformatics, 23:400, 2022.
https://doi.org/10.1186/s12859?022?04932?3
-
Effective Entity Augmentation By Querying External Data Sources
Christopher Buss, Mikhail Tokarev, Jasmin Mousavi, Arash Termehchy, David Maier, and Stefan Lee
Technical Report, 2022
2021
-
Structural Generalizability: The Case of Similarity Search
Yodsawalai Chodpathumwan, Arash Termehchy, Aayam Shrestha, Stephen Ramsey, Amy Glen, and Zheng Liu
The Proceedings of SIGMOD, 2021.
The full version with proofs
-
Scalable and Usable Relational Learning With Automatic Language Bias
Jose Picado, Arash Termehchy, Alan Fern, Sudhanshu Pathak, Praveen Ilango, and John Davis,
The Proceedings of SIGMOD, 2021.
-
RTX-KG2: A System for Building a Semantically Standardized Knowledge Graph for Translational Biomedicine
E. C. Wood, Amy K. Glen, Lindsey G. Kvarfordt, Finn Womack, Liliana Acevedo, Timothy Yoon, Chunyu Ma, Veronica Flores, Meghamala Sinha, Yodsawalai Chodpathumwan, Arash Termehchy, Jared C. Roach, Luis
Mendoza, Andrew S. Hoffman, Eric Deutsch, David Koslicki, Stephen A. Ramsey
BioRXIV, 2021.
2020
-
Learning Over Dirty Data Without Cleaning [Slides]
Jose Picado, John Davis, Arash Termehchy, and Claire Lee
The Proceedings of SIGMOD, 2020.
Technical report with proofs
-
Bandit Join; Preliminary Results
Vahid Ghadakchi, Mian Xie, Arash Termehchy
The Proceedings of SIGMOD Workshop on AI & Data Management (aiDM), 2020.
-
Usable & Scalable Learning Over Relational Data With Automatic Language Bias
Jose Picado, Arash Termehchy, Alan Fern, Sudhanshu Pathak, Praveen Ilango, and Yunqiao Cai
Technical report, February 2020.
2019
-
A Game-Theoretic Approach to Data Interaction
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Behrouz Touri and Liang Huang
The ACM Transactions on Database Systems (TODS)
-
How Do Users and Data Systems Establish a Common Query Language?
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Liang Huang and Behrouz Touri
SIGMOD Record on ACM SIGMOD Research Highlights 48 (1), ,2019
-
Less Data Delivers Higher Effectiveness for Keyword Queries
Vahid Ghadakchi, Abtin Khodadad, and Arash Termehchy
In Proceedings of SSDBM, 2019
-
Structurally Robust Similarity Search
Yodsawalai Chodpathumwan, Arash Termehchy, Steven Ramsey, Aayam Shresta
Technical Report, 2019
-
Logically Scalable and Efficient Relational Learning
Jose Picado, Arash Termehchy, Alan Fern, and Parisa Ataie
The VLDB Journal, 2019.
2018
-
Progressive Interaction for Autonomous Entity Matching [Slides]
Ben McCamish, Arash Termehchy
The Proceedings of Poly, September 2018.
-
Managing Structurally Heterogeneous Databases in Software Product Lines
Parisa Ataei, Arash Termehchy, and Eric Walkingshaw
The Proceedings of Poly, September 2018.
-
Learning Efficiently Over Heterogenous Databases [Poster]
Jose Picado, Sudhanshu Pathak, and Arash Termehchy
The Proceedings of the VLDB Endowment (Demonstration Track) , August 2018.
-
The Data Interaction Game [One-slide teaser] [Slides]
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Behrouz Touri and Liang Huang
The Proceedings of SIGMOD, June 2018.
Selected as one of the best papers in SIGMOD 2018
-
Cost-Effective Conceptual Design Using Taxonomies
Yodsawalai Chodpathumwan, Ali Vakilian, Arash Termehchy and Amir Nayyeri
The VLDB Journal, April 2018.
A preliminary version appeared in The Proceedings of WebDB, May 2017
-
AutoMode: Relational Learning With Less Black Magic

Jose Picado, Sudhanshu Pathak, Arash Termehchy, and Alan Fern
The Proceedings of ICDE (Demonstration Track), April 2018.
-
There is no Dichotomy Between Effectiveness and Efficiency in Keyword Query Processing [Slides]
Vahid Ghadakchi, Arash Termehchy
The Proceedings of ICDE (Lightening Talk), April 2018.
2017
-
Schema Independent Relational Learning [One-slide teaser] [Slides]
Jose Picado, Arash Termehchy, Alan Fern, and Parisa Ataie
The Proceedings of SIGMOD, May 2017.
-
Towards Automatically Setting Language Bias in Relational Learning [Slides]
Jose Picado, Arash Termehchy, Alan Fern, and Sudhanshu Pathak
In Proceedings of SIGMOD Workshop on Data Management for End-to -End Machine Learning (DEEM), May 2017
-
A Signaling Game Approach to Databases Querying - A Progress Report
Ben McCamish, Arash Termehchy, Behrouz Touri
The Proceedings of SIGMOD Workshop on Human-In-the-Loop Data Analytics (HILDA), May 2017.
-
Representational Scalability
Jose Picado
The Conference on Innovative Data Systems Research (CIDR), abstract, January 2017.
-
Reaching Mutual Understanding in a Society of Humans and Database Systems
Arash Termehchy
The Conference on Innovative Data Systems Research (CIDR), abstract, January 2017.
2016
-
Towards Representation Independent Similarity and Proximity Search
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy, and Yizhou Sun
The Proceedings of CIKM, October 2016.
-
Schema Independent and Scalable Relational Learning By Castor
Jose Picado, Parisa Ataie, Arash Termehchy, and Alan Fern
The Proceedings of the VLDB Endowment (Demonstration Track), September 2016.
-
A Signaling Game Approach to Databases Querying and Interaction
Ben McCamish, Arash Termehchy, Behrouz Touri
Technical Report, March 2016.
2015
-
A Signaling Game Approach to Database Querying
Arash Termehchy and Behrouz Touri
The Proceedings of SIGIR International Conference on Theory of Information Retrieval (ICTIR), September 2015.
-
Schema Independent Relational Learning
Jose Picado, Arash Termehchy, and Alan Fern
Technical Report, August 2015.
-
Representation Independent Similarity and Proximity Search
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy, and Yizhou Sun
Technical Report, August 2015.
-
Universal-DB: Towards Representation Independent Graph Analytics
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy and Yizhou Sun
The Proceedings of the VLDB Endowment (Demonstration Track), September 2015.
-
Cost-Effective Conceptual Design Using Taxonomies
Ali Vakilian, Yodsawalai Chodpathumwan, Arash Termehchy and Amir Nayyeri
Technical Report, April 2015.
-
Cost Effective Conceptual Design for Information Extraction
Arash Termehchy, Ali Vakilian, Yodswalai Chodpathumwan and Marianne Winslett
The ACM Transactions on Database Systems (TODS), June 2015.
2014
-
Representation Independent Analytics Over Structured Data
Yodswalai Chodpathumwan, Jose Picado, Arash Termehchy, Alan Fern, and Yizhou Sun
Technical Report, August 2014.
A summary of the paper appeared at the ICDM Workshop on Data Wrangling Automation, December 2016.
-
Which Concepts Are Worth Extracting? [Slides]
Arash Termehchy, Ali Vakilian, Yodswalai Chodpathumwan, and Marianne Winslett
The Proceedings of SIGMOD, June 2014.
-
Schema Independence of Learning Algorithms
Jose Picado, Arash Termehchy, and Alan Fern
The SIGMOD Workshop on Big Uncertain Data (BUDA), June 2014.
-
Toward Representation Independent Similarity Search Over Graphs
Yodswalai Chodpathumwan, Arash Termehchy, Yizhou Sun, Amirhossein Aleyasin, and Jose Picado
The SIGMOD Workshop on Graph Data Management Experiences and Systems(GRADES), June 2014.
-
Efficient Prediction of Difficult Keyword Queries over Databases
Chen Shiwen, Arash Termehchy, and Vagelis Hristidis
The IEEE Transactions on Knowledge and Data Engineering (TKDE), March 2014.
Before 2014
-
Predicting the Effectiveness of Keyword Queries on Databases
Chen Shiwen, Arash Termehchy, and Vagelis Hristidis
The Proceedings of ACM International Conference on Information and Knowledge Management (CIKM), October 2012 (13.5% acceptance).
-
Schema Independent Query Interfaces
Arash Termehchy, Marianne Winslett, Yodswalai Chodpathumwan, and Austin Gibbons
The IEEE Transactions on Knowledge and Data Engineering (TKDE), Special Issue on the Best Papers of ICDE 2011, July 2012.
-
How Schema Independent are Schema Free Query Interfaces?
[Teaser Slide]
Arash Termehchy, Marianne Winslett, and Yodsawalai Chodpathumwan
The Proceedings of IEEE International Conference on Data Engineering (ICDE), April 2011, (19.8% acceptance)
Best Student Paper Award .
-
Using Structural Information in XML Keyword Search Effectively
Arash Termehchy and Marianne Winslett,
The ACM Transactions on Database Systems (TODS), March 2011.
-
EXTRUCT: Using Deep Structural Information in XML Keyword Search
Arash Termehchy and Marianne Winslett
The Proceedings of the VLDB Endowment (PVLDB), September 2010.
-
Keyword Search over Key-Value Stores
Arash Termehchy and Marianne Winslett
The Proceedings of the World Wide Web Conference (WWW), April 2010 (poster paper).
-
Keyword Search for Data-Centric XML Collections with Long Text Fields
Arash Termehchy and Marianne Winslett
The Proceedings of the International Conference on Extending Database Technology (EDBT), 2010 (18% acceptance).
-
Effective, Design Independent XML Keyword Search
Arash Termehchy and Marianne Winslett,
The ACM International Conference on Information and Knowledge Management (CIKM), October 2009 (14.5% acceptance).
-
Maitri: A Format-Independent Framework for Managing Large Scale Scientific Data
Rishi R. Sinha, Arash Termehchy, Marianne Winslett, Soumyadeb Mitra, and John Noris
The Conference on Innovative Data Systems Research (CIDR), January 2007.
Shield template by BlackTie.co