Behzad Golshan

PUBLICATIONS

2022

TWICE – Twitter Content Embeddings — DL4SR 2022

Xianjing Liu, Behzad Golshan, Kenny Leung, Aman Saini, Vivek Kulkarni, Ali Mollahosseini, Jeff Mo

2021

Adaptive Rule Discovery for Labeling Text Data — SIGMOD 2021

Sainyam Galhotra, Behzad Golshan, Wang-Chiew Tan

2020

SubjQA: A Dataset for Subjectivity and Review Comprehension — EMNLP 2020

Johannes Bjerva, Nikita Bhutani, Behzad Golshan, Wang-Chiew Tan, Isabelle Augenstein

Sampo: Unsupervised Knowledge Base Construction for Opinions and Implications — AKBC 2020
Nikita Bhutani, Aaron Traylor, Chen Chen, Xiaolan Wang, Behzad Golshan, Wang-Chiew Tan

Enhancing Review Comprehension with Domain-Specific Commonsense — arXiv

A. Traylor, C. Chen, B. Golshan, X. Wang, Y. Li, Y. Suhara, J. Li, C. Demiralp, W. Tan

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization — AAAI 2020
Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan

2019

Building a Hotel Concierge Bot: an industrial case study — CAST 2019

B. Golshan, G. Mihaila, C. Chen, J. Engel, Al. Halevy, Y. Suhara, W. Tan, M. Matuschek

Essentia: Mining Domain-specific Paraphrases with Word-Alignment Graphs — TextGraphs 2019

Danni Ma, Chen Chen, Behzad Golshan, Wang-Chiew Tan

Recommendations for optimizing the collective user experience — SDM 2019

Behzad Golshan, Evimaria Terzi, Panayiotis Tsaparas

A team-formation algorithm for faultline minimization — Expert Systems with Applications 2019

Sanaz Bahargam, Behzad Golshan, Theodoros Lappas, Evimaria Terzi

2018

Koko: a system for scalable semantic querying of text — PVLDB 2018 (Demo Track)

Xiaolan Wang, Jiyu Komiya, Yoshihiko Suhara, Aaron Feng, Behzad Golshan, Alon Halevy, Wang-Chiew Tan

Scalable semantic querying of text — PVLDB 2018

Xiaolan Wang, Aaron Feng, Behzad Golshan, Alon Halevy, George Mihaila, Hidekazu Oiwa, Wang-Chiew Tan

Happydb: A corpus of 100,000 crowdsourced happy moments — LREC 2018

A. Asai, S. Evensen, B. Golshan, A. Halevy, V. Li, A. Lopatenko, D. Stepanov, Y. Suhara, W. Tan, Y. Xu

BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration. — IEEE Data Engineering Bulletin 2018

Chen Chen, Behzad Golshan, Alon Y Halevy, Wang-Chiew Tan, AnHai Doan

2017

Minimizing tension in teams — CIKM 2017

Behzad Golshan, Evimaria Terzi

Homogeneity in web search results: Diagnosis and mitigation — Transactions on Intelligent Systems and Technology 2017

Rakesh Agrawal, Behzad Golshan, Evangelos E Papalexakis

Data integration: After the teenage years — SIGMOD/PODS (Invited Paper)

Behzad Golshan, Alon Halevy, George Mihaila, Wang-Chiew Tan

Finding low-tension communities — SDM 2017

Esther Galbrun, Behzad Golshan, Aristides Gionis, Evimaria Terzi

2016

Overlap in the web search results of google and bing — Journal of Web Science 2016

Rakesh Agrawal, Behzad Golshan, Evangelos Papalexakis

Toward data-driven design of educational courses: A feasibility study — JEDM 2016

Rakesh Agrawal, Behzad Golshan, Evangelos Papalexakis

2015

Overlap Between Google and Bing Web Search Results! Twitter to the Rescue? — COSN (Conference on Online Social Networks) 2015
Rakesh Agrawal, Behzad Golshan, Evangelos Papalexakis

Whither social networks for web search? — KDD 2015 (Industry Track)
Rakesh Agrawal, Behzad Golshan, Evangelos Papalexakis

A study of distinctiveness in web results of two search engines — WWW 2015 Companion (Web Science Track)
Rakesh Agrawal, Behzad Golshan, Evangelos Papalexakis

2014

Profit-maximizing cluster hires — KDD 2014
Behzad Golshan, Theodoros Lappas, Evimaria Terzi

Grouping students in educational settings — KDD 2014
Rakesh Agrawal, Behzad Golshan, Evimaria Terzi

Unveiling Variables in Systems of Linear Equations — SDM 2014
Behzad Golshan, Evimaria Terzi

Forming beneficial teams of students in massive online classes — L@S 2014

Rakesh Agrawal, Behzad Golshan, Evimaria Terzi

2013

What do row and column marginals reveal about your dataset? — NeurIPS 2013
Behzad Golshan, John Byers, Evimaria Terzi

2012

A framework for evaluating the smoothness of data-mining results — PKDD 2012
Gaurav Misra, Behzad Golshan, Evimaria Terzi

Sofia Search: a tool for automating related-work search — SIGMOD 2012 (Demo Track)
Behzad Golshan, Theodoros Lappas, Evimaria Terzi

WORK EXPERIENCE

Google Software Engineer (2023 – Now)

I’m working on incorporating AI capabilities into different Google products (e.g., SGE while browsing & Illuminate).

Twitter ML Engineer (2022-2023)

I worked on developing and utilizing NLP signals to improve different products at Twitter.

Megagon Labs Research Scientist (2016-2021)

I have been working on different NLP & machine-learning problems. Creating knowledge-bases (KBs) automatically from reviews, boosting performance of NLP models through KBs, and building question-answering (QA) and dialogue systems are a few examples.

EDUCATION HISTORY

Boston University PhD in Computer Science (2010-2016)

I did my PhD at Massive Data, Algorithms, and Systems (MiDAS) Group under supervision of Prof. Evimaria Terzi. Most of my PhD work has been focused on data-mining methods and combinatorial optimization algorithms for real-world graphs.

University of Tehran BSc in Computer Engineering (2005-2010)

I love this place! This is where I started, made wonderful friends, and learned a lot about computers, algorithms, and myself.

INTERNSHIPS EXPERIENCES

Aalto University Research Intern (Summer 2015)

Microsoft Research Research Intern (Summer 2014)

oDesk (now Upwork) Research Intern (Summer 2013)

Sapienza University Research Intern (Summer 2012)

Adverplex (now Cogo Labs) Research Intern (Summer 2011)

VIEW MY LINKEDIN PROFILE

MY LINKEDIN PROFILE

A LITTLE ABOUT ME

THE TODO LIST

My longest run is 13.1 Miles

My longest run is 13.1 Miles

I’m at 20 countries (excluding Iran & the US)

I’m at 6 national parks

I’m at 0 pull-ups!

I’m at 104 out of 1193 Duolingo lessons

LET’S SOCIALIZE

LATEST FROM THE BLOG

KDD 2020 Highlights

NAACL 2018 Talks (Summary)

NAACL 2018 Keynotes (Summary)