Michael Gubanov

Assistant Professor
News: Three more papers accepted by ACM CIKM 2024, EDBT 2025, and ICDE 2025!
              Awarded a grant from FL Department of Health Casey DeSantis Florida Cancer Innovation Fund as a PI for constructing a hybrid Polystore/LLM simplifying access to Cancer Data Lakes!
              Awarded an NSF grant as a PI for constructing Web-scale Knowledge Graphs and LLMs for Data Science!
              Received the AWS AI Amazon Research Award (ARA)! Thanks to Amazon for supporting BigLab! research!
              One more paper by BigLab! accepted to The Web Conference (WWW) 2023, held in Austin, TX this year!
              Two more papers accepted, total 3 papers presented at EDBT by BigLab! this year!!
              Our COVIDKG.ORG paper was accepted by EDBT 2023!
              Our COVID-19 Web-scale vizualization paper was accepted to ACM CIKM 2022!
              Our tabular profiling paper was accepted to ACM SIGMOD 2022!


            Watch my talk at MIT on our Hybrid Linear Relational Engine
Florida State University
Computer Science Department

1017 Academic Way
Tallahassee, FL 32304
gubanov at cs.fsu.edu
Publications in DBLP
(external publication tracking system)
 
Follow me on ResearchGate

About

I am an Assistant Professor at Florida State University and a founder of BigLab!, where we do research in AI, LLMs and Large-scale Data Management with applications in Healthcare. As a lead in the consortsium Casey DeSantis CRI DOH grant my group spends significant time working with Moffitt Cancer Center and Research Institute in Tampa, FL. Our recent projects are CancerKG.ORG, CovidKG.ORG, and AgingGraph.ORG

I earned my Ph.D. and M.Sc. in Computer Science from the University of Washington. I did my Postdoc at Computer Science and Artificial Intelligence Laboratory (CSAIL) of M.I.T.. I completed my undergraduate education at St. Petersburg ITMO University (ACM-ICPC World Champions 7 times). Before that I graduated from the Presidential Physics/Mathematics Lyceum #239. I was also a member of the Russian National Team in Physics.

I spent some time working in industry to apply my research - At IBM Almaden Research Center on Data Integration (project Clio, productized as a part of IBM Infosphere); at Google on Web-search and Large-scale Machine Learning (productized as parts of SETI and Froogle); at Microsoft Research on Natural Language Processing (productized as a part of Bing!).

Selected Funding

  • Florida Department of Health (DOH) (PI, $1.2 million) Casey DeSantis Florida Cancer Innovation Fund; Constructing a hybrid Polystore/LLM simplifying access to Cancer Data Lakes, 2024-current
  • National Science Foundation (NSF) (PI, $550,000) Search for the Unknown - A Hybrid Scalable Data Management System Providing Deep Access to the Scientific Knowledge in Data Science, 2024-current
  • AWS AI Amazon Research Award (PI, $70,000), 2022-2024
  • National Science Foundation (NSF) I-CORPS (PI, $50,000), COVIDKG.ORG, 2022-2023

Selected Awards/Honors

  • IEEE ICDE Outstanding Program Committee Member Award, 2025
  • AWS AI Amazon Research Award, 2023
  • Communications of the ACM (CACM) Research Highlight Award, 2020.
  • ACM SIGMOD Research Highlight Award, Houston, TX, 2018.
  • IEEE ICDE Best Paper Award, San Diego, CA, 2017.

Selected Publications

For more information please consult Google Scholar and DBLP
  • "Scalable Tabular Hierarchical Metadata Classification in Heterogeneous Structured Large-scale Datasets using Contrastive Learning"
    Bhim Kandibedala, Gyanendra Shrestha, Anna Pyayt, Todor Ivanov, Michael Gubanov, in ICDE, 2025[pdf]
  • "Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting"
    Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael Gubanov, in EDBT, 2025 [pdf]
  • "CancerKG.ORG - a Web-scale, Interactive, Verifiable Knowledge Graph-LLM Hybrid for Assisting with Optimal Cancer Treatment and Care"
    Michael Gubanov, Anna Pyayt, Aleksandra Karolack, in ACM CIKM, 2024 [pdf]
  • "Learning Topical Structured Interfaces from Medical Research Literature"
    Maitry Chauhan, Anna Pyayt, Michael Gubanov, in The Web Conference (WWW), 2023[pdf]
  • "Simplifying Access to Large-scale Structured Datasets by Meta-Profiling with Scalable Training Set Enrichment"
    Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in ACM SIGMOD, 2022[pdf]
  • Scalable Linear Algebra on a Relational Database System
    Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, in the Communications of the ACM (CACM), 08/2020, Research Highlight[html]
  • DataXFormer: Leveraging the Web for Semantic Transformations  [bib] [pdf]
    Zia Abedjan, John Morcos, Michael Gubanov, Ihab Ilyas, Michael Stonebraker, Paolo Papotti, Mourad Ouzanni, in CIDR 2015, Asilomar, California
  • Large-scale Semantic Profile Extraction  [bib] [pdf]
    Michael Gubanov, Michael Stonebraker EDBT 2014, Athens, Greece
  • Text and Structured Data Fusion in DataTamer at Scale  [bib] [pdf]
    Michael Gubanov, Michael Stonebraker, Daniel Bruckner, IEEE ICDE 2014, Chicago, Illinois

Selected Service

  • Program Committee (PC), AAAI, 2022, 2026
  • Program Committee (PC), The Web Conference (WWW) 2022-24, 2026
  • Program Committee (PC), VLDB, 2026
  • Program Committee (PC), IEEE ICDE, 2022, 2023, 2026
  • Program Committee (PC), EDBT, 2022, 2025, 2026
  • Program Committee (PC), ACM SIGMOD, 2022, 2023, 2025
  • Program Committee (PC), ACM WSDM 2023, 2024
  • Reviewer, IEEE TKDE Journal Special Issue (ICDE Best Paper), 2023
  • Senior Program Committee (SPC), EDBT, 2022
  • Program Committee (PC), Vice Chair, IEEE BigData, 2021
  • Reviewer, ACM SIGMOD Record, 2021
  • Reviewer, IEEE TKDE Journal, 2021
"Much have I learned from my teachers, more from my colleagues, but from my students, most of all."

--Rabbi Hanina (b. Ta'anit 7a)

Postdocs
  • Dr. Todor Ivanov
Graduate students(advice by Jason Eisner,Kevin Gimpel,David Peterson)
  1. Gyanendra Shrestha
  2. Chutian Jiang
  3. Sai Ganesh
  4. Mamatha Edivelli
  5. Nilkod Yashaswini
  6. Kartik Vemireddy
  7. Ishitha Yarlagada
  8. Kalyan Kadari
  9. Shardha Hirve
  10. Ruchita Munugala
BigLab! Alumni
  1. Manju Krishnan, US Bank, TX, Full Stack Software Engineer; Amazon Alexa, WA, Software Engineer
  2. Anusha Kola, HCL America, TX, Software Engineer
  3. Sai Amirishetty, Walmart, AR, Software Engineer
  4. Yuqi Li
  5. Maxim Podkorytov, Facebook, CA, Research Scientist
  6. Bhim Kandibedala, Builder Homesite, TX, Software Engineer
  7. Sophie Pavia
  8. William Goble, Dickinson College, PA, Visiting Assistant Professor
  9. Maitry Chauhan, Bank of America, NJ, Software Engineer

Links