Michael Gubanov

Assistant Professor
News: Awarded a grant from FL Department of Health Casey DeSantis Florida Cancer Innovation Fund for constructing a hybrid Polystore/LLM simplifying access to Cancer Data Lakes!
              Awarded an NSF grant (PI, $550,000) for constructing Web-scale Knowledge Graphs and LLMs for Data Science!
              Received the AWS AI Amazon Research Award (ARA)! Thanks to Amazon for supporting BigLab! research!
              One more paper by BigLab! accepted to The Web Conference (WWW) 2023, held in Austin, TX this year!
              Two more papers accepted, total 3 papers presented at EDBT by BigLab! this year!!
              Our COVIDKG.ORG paper was accepted to EDBT 2023!
              Our COVID-19 Web-scale vizualization paper was accepted to ACM CIKM 2022!
              Our tabular profiling paper was accepted to ACM SIGMOD 2022!
              Launched COVIDKG.ORG and AGINGGRAPH.ORG
              Our New Data Science courses approved by FSU and State of Florida!
              Elected to Sigma Xi, 2021
              Communications of the ACM 2020 Research Highlight Award!
              Communications of the ACM (CACM) 2020 published our "Scalable Linear Algebra" paper!
              PG 2020 Award! Thank you to FSU!
              Congratulations to the award-winning teams from my FSU Data Science course this year! Team1, Team2, Team3

              FYAP 2019 Award! Thank you to FSU!

              ACM SIGMOD Research Highlight Award, 2018!
              ICDE 2017 award paper invited to a special issue of ACM SIGMOD Record
              ICDE 2017 award paper invited to publication in "Best of ICDE" issue of TKDE 2018
              ICDE 2017 Best Paper Award!

            Watch my talk at MIT on our Hybrid Linear Relational Engine
Florida State University
Computer Science Department

1017 Academic Way
Tallahassee, FL 32304
gubanov at cs.fsu.edu
Follow me on ResearchGate
2023
  1. "Learning Topical Structured Interfaces from Medical Research Literature"
    Maitry Chauhan, Anna Pyayt, Michael Gubanov, in The Web Conference (WWW), 2023 [pdf]
  2. "COVIDKG.ORG - a Web-scale COVID-19 Interactive, Trustworthy Knowledge Graph, Constructed and Interrogated for Bias using Deep-Learning"
    Bhim Kandibedala, Anna Pyayt, Nick Piraino, Chris Caballero, Michael Gubanov, in EDBT, 2023
  3. "Learning Circular Tabular Embeddings for Heterogeneous Large-scale Structured Datasets"
    Michael Gubanov, Anna Pyayt, Sophie Pavia, in EDBT, DOLAP 2023
  4. "Scalable Metadata Classification in Heterogeneous Large-scale Datasets"
    Bhim Kandibedala, Anna Pyayt, Michael Gubanov, in EDBT, DOLAP 2023
2022
  1. "Visualizing and Querying Large-scale Structured Datasets by Learning Multi-layered 3D Meta-Profiles"
    Michael Gubanov, Anna Pyayt, Sophie Pavia, in IEEE BigData, 2022, acc. rate 18.6%
  2. "Leveraging Scalable Profiling to Learn and Visualize the Latest Trustworthy COVID-19 Medical Research Findings"
    Michael Gubanov, Sophie Pavia, Anna Pyayt, William Goble, in ACM CIKM, 2022[pdf]
  3. "Simplifying Access to Large-scale Structured Datasets by Meta-Profiling with Scalable Training Set Enrichment"
    Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in ACM SIGMOD, 2022[pdf]
  4. "Hybrid Metadata Classification in Large-scale Structured Datasets"
    Sophie Pavia, Nick Piraino, Kazi Islam, Anna Pyayt, Michael Gubanov, invited paper in the journal of Data Intelligence, Rinton Press, Special Issue on "Best of DEXA", 2022 [pdf]
2021
  1. "Scalable Tabular Metadata Location and Classification in Large-scale Structured Datasets"
    Kazi Islam, Michael Gubanov, in DEXA, Springer Nature, 2021, online[pdf]
  2. "Towards Unveiling Dark Web Structured Data"
    Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in IEEE BigData, 2021[pdf]
  3. "Learning Tabular Embeddings at Web Scale"
    Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael Gubanov, in IEEE BigData, 2021[pdf]
2020
  1. WebLens: Towards Interactive Large-scale Structured Data Profiling
    Rituparna Khan, Michael Gubanov, in ACM CIKM 2020, online [pdf]
  2. Scalable Linear Algebra on a Relational Database System
    Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, in the Communications of the ACM (CACM), 08/2020 (Research Highlight Honor)[html]
  3. WebLens: Towards Interactive Web-scale Data Integration, Training the Models
    Rituparna Khan, Michael Gubanov, in IEEE BigData 2020, online
  4. Towards Tabular Embeddings, Training the Relational Models
    Rituparna Khan, Michael Gubanov, in IEEE BigData 2020, online
  5. Rapid Antibiotic Susceptibility Analysis Using Microscopy and Machine Learning
    Anna Pyayt, Rituparna Khan, Robert Brzozowski, Prahathees Eswara, Michael Gubanov, in IEEE BigData 2020, online 
2019
  1. Hybrid.Poly: A Consolidated Interactive Analytical Polystore System
    Maksim Podkorytov, Michael Gubanov, in ICDE 2019, Macao, China SAR [pdf]
  2. Scalable Linear Algebra on a Relational Database System  [pdf]
    Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, Extended Journal Version, to appear in IEEE Transactions on Knowledge and Data Engineering (TKDE), Special Issue on "Best of ICDE"
2018
  1. Nested Dolls: Towards Unsupervised Web Tables Clustering [pdf]
    Rituparna Khan, Michael Gubanov, in IEEE Bigdata 2018, Seattle, WA
  2. Hybrid.Poly: Performance Evaluation of Linear Algebra Analytical Extensions [pdf]
    Maksim Podkorytov, Michael Gubanov, in IEEE Bigdata 2018, Seattle, WA
  3. Hybrid.AI: A Learning Search Engine for Large-scale Structured Data  [pdf]
    Sean Soderman, Anusha Kola, Maxim Podkorytov, Michael Geyer, Michael Gubanov , in WWW, Search, 2018, Lyon, France
  4. Scalable Linear Algebra on a Relational Database System  [pdf]
    Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, ACM SIGMOD Record, March 2018, special issue for the "2017 ACM SIGMOD Research Highlights"
  5. Hybrid.AI: An AI-Augmented Search Engine for Large-scale Structured Data  [pdf]
    Michael Gubanov, Sean Soderman, Anusha Kola, Maxim Podkorytov, MIT Annual Database Research Conference 2018, Cambridge, MA
  6. Hybrid.Poly: An Interactive Large-scale In-memory Analytical Polystore  [pdf]
    Michael Gubanov, Maxim Podkorytov, Anusha Kola, Dylan Soderman, MIT Annual Database Research Conference 2018, Cambridge, MA
2017
  1. CognitiveDB: An Intelligent Navigator for Large-scale Dark Structured Data  [pdf]
    Michael Gubanov, Manju Priya, Maxim Podkorytov, in WWW 2017, Perth, Australia
  2. PolyFuse: A Large-scale Hybrid Data Integration System  [pdf]
    Michael Gubanov, in IEEE ICDE DESWEb 2017, San Diego, CA
  3. Scalable Linear Algebra on a Relational Database System  [pdf]
    Shangyu Luo, Zekai Gao, Michael Gubanov, Christopher Jermaine, Luis Perez, IEEE ICDE 2017, San Diego, CA Best Paper Award
  4. Hybrid: A Large-scale In Memory Image Analytics System  [pdf]
    Michael Gubanov, in CIDR 2017, Chaminade, CA
  5. Hybrid.poly: An Interactive Large-scale In-memory Analytical Polystore  [pdf]
    Maxim Podkorytov, Dylan Soderman, Michael Gubanov, in ICDM DSBDA 2017, New Orleans, LA
  6. mHealth Dipstick Analyzer For Monitoring of Pregnancy Complications  
    Karthik raj Konnaiyan, Surya Cheemalapati, Michael Gubanov and Anna Pyayt, in IEEE Sensors 2017
  7. IntelliLIGHT: A Flashlight for Large-scale Dark Data  [pdf]
    Michael Gubanov, Manju Priya, Maxim Podkorytov, MIT Annual Database Research Conference 2017, Cambridge, MA
  8. Hybrid.JSON: High-velocity Parallel In-Memory Polystore JSON Ingest  [pdf]
    Steven Ortiz, Caner Enbatan, Maksim Podkorytov, Dylan Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
  9. Scalable Spam Classifier for Web Tables  [pdf]
    Santiago Villasenor, Tom Nguyen, Anusha Kola, Sean Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
  10. Hybrid.media: High Velocity Video Ingestion in an In-Memory Scalable Analytical Polystore  [pdf]
    Mark Simmons, Daniel Armstrong, Dylan Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
  11. Generating UFOs from the Classified Object Tables  [pdf]
    Anusha Kola, Harshal More, Sean Soderman, Michael Gubanov, in IEEE Bigdata 2017, Boston, MA
2016
  1. Hybrid: A Large-scale Linear-Relational Database Management System  [pdf][video]
    Michael Gubanov, Chris Jermaine, Zekai Gao, Shangyu Luo, MIT Annual Database Research Conference 2016, Cambridge, MA
  2. Type-aware Web search  [pdf]
    Michael Gubanov, Anna Pyayt , in EDBT 2016, Bordeaux, France
  3. mHealth Dipstick Analyzer for Monitoring of Pregnancy Complications  [pdf]
    Karthik Konnaiyan, Surya Cheemalapati, Michael Gubanov, Anna Pyayt in IEEE Sensors 2016, Orlando, FL, Best Paper Award
  4. Real Time Fear Detection Using Wearable Single Channel Electroencephalogram  [pdf]
    Surya Cheemalapati, Prashanth Chetlur Adithya, Michael Del Valle, Michael Gubanov, Anna Pyayt, in International Journal of Sensor Networks and Data Communications, 2016
2015
  1. DataXFormer: Leveraging the Web for Semantic Transformations  [bib] [pdf]
    Zia Abedjan, John Morcos, Michael Gubanov, Ihab Ilyas, Michael Stonebraker, Paolo Papotti, Mourad Ouzanni, in CIDR 2015, Asilomar, California
  2. Mobile Phone-based Assessment and Prevention of Excessive Bleeding after Military Trauma
    Michael Gubanov, Anna Pyayt , Defence Innovation, Technology Accelration Challenges, 2016, Austin, TX
2014
  1. Large-scale Semantic Profile Extraction  [bib] [pdf]
    Michael Gubanov, Michael Stonebraker EDBT 2014, Athens, Greece
  2. Text and Structured Data Fusion in DataTamer at Scale  [bib] [pdf]
    Michael Gubanov, Michael Stonebraker, Daniel Bruckner, IEEE ICDE 2014, Chicago, Illinois
  3. Web-scale Synonym Resolution
    Michael Gubanov, Michael Stonebraker MIT NEDB 2014, Cambridge, Massachusetts
2013
  1. ReadFast: High-relevance Search-engine for Big Text [bib] [pdf]
    Michael Gubanov, Anna Pyat. ACM CIKM 2013, San Francisco, California
  2. Bootstraping Synonym Resolution at Web Scale
    Michael Gubanov, Michael Stonebraker DIMACS/CCICADA Workshop on Big Data Integration 2013, New Brunswick, New Jersey
  3. ReadFast: High-relevance Search-engine for EMR
    Michael Gubanov, Anna Pyayt. MIT Innovations in Health Care Conference 2013, Cambridge, Massachussetts
  4. ReadFast: Structural Information Retrieval from Biomedical Big Text by Natural Language Processing [bib][pdf]
    Michael Gubanov, Linda Shapiro, Anna Payt.
    Invited book chapter in "Information Reuse And Integration In Academia And Industry", Springer 2013

  5. ReadFast: Optimizing Structural Search Relevance for Big Medical Text [bib]
    Michael Gubanov, Anna Pyayt. in IEEE Information Reuse and Integration (IRI) 2013, San Francisco, California
  6. BigDB: Automatic machine learning optimizer [pdf]
    Michael Gubanov, Anna Pyayt. arXiv:1301.1575 [cs.DB], 2013
  7. A real-time classification algorithm for emotion detection using portable EEG
    Surya Cheemalapati, Michael Gubanov, Michael Del Valle, Anna Pyayt. IEEE Information Reuse and Integration (IRI), 2013, San Francisco, CA
  8. Using online stream-processing for portable electroencephalography system for stress and fear detect
    Surya Cheemalapati, Michael Del Valle, Michael Gubanov, Anna Pyayt. in Southern Biomedical Engineering Conference (SBEC), 2013, Miami, Florida
  9. Using stream processing for on-chip measurement of speed of blood coagulation
    Drew Neihart, Michael Ladanov, Michael Gubanov, Anna Pyayt. in Southern Biomedical Engineering Conference (SBEC), 2013, Miami, Florida
2012
  1. MedReadFast: Structural Information Retrieval Engine for Big Clinical Text [bib] [pdf]
    Michael Gubanov, Anna Pyayt. IEEE Information Reuse and Integration (IRI) 2012, Las Vegas, Nevada
  2. Detecting blood coagulation on-chip
    Drew Neihart, Carla Perla, Michael Gubanov, Anna Pyayt, in International Material Research Congress (IMRC), Cancun, Mexico, 2012
  3. Hemolysis sensor
    Justin Stewart, Harry Tuazon, Anthony Zappa, Fei Mo, Edikan Archibong, Michael Gubanov, Anna Pyayt, in International Material Research Congress (IMRC), Cancun, Mexico, 2012
  4. Using Light to Detect Blood Coagulation
    Drew Neihart, Carla Perla, Michael Gubanov, Anna Pyayt, in American Institute of Chemical Engineers (AIChE), Clearwater Beach, Florida, 2012
  5. Detection of Hemoglobin in Plasma
    Justin W. Stewart, Harry Tuazon, Michael Gubanov, Anna Pyayt, in American Institute of Chemical Engineers (AIChE), Clearwater Beach, Florida, 2012
2011
  1. ReadFast: Browsing large documents through Unified Famous Objects (UFO). [bib] [pdf]
    Michael Gubanov, Linda Shapiro, Anna Payt. IEEE Information Reuse and Integration (IRI) 2011, Las Vegas, Nevada; acceptance rate 29%
  2. Learning Unified Famous Objects (UFO) to Bootstrap Information Integration. [bib] [pdf] [book]
    Michael Gubanov, Linda Shapiro, Anna Payt. IEEE Information Reuse and Integration (IRI) 2011, Las Vegas, Nevada; acceptance rate 29%
2010-2006
  1. Using Unified Famous Objects (UFO) to Automate Alzheimer's Disease Diagnostics. [bib] [pdf]
    Michael Gubanov, Linda Shapiro. IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2010, Atlanta, Georgia
  2. IBM UFO Repository: Object-oriented Data Integration [bib] [pdf] [book]
    Michael Gubanov, Lucian Popa, Howard Ho, Hamid Pirahesh, Jeng-Yih Chang, Shr-Chang Chen. VLDB 2009, Lyon, France
  3. Simplifying Information Integration: Object-based flow-of-mappings framework for integration [bib]
    Bogdan Alexe, Michael Gubanov, Mauricio A. Hernandez, Howard Ho, Jen-Wei Huang, Yannis Katsis, Lucian Popa, Barna Saha, Ioana Stanoi.
    Invited book chapter in "Business Intelligence for the Real Time Enterprise", Springer 2009
  4. Metadata Management Engine for Data Integration with Reverse-Engineering Support [bib][pdf]
    Michael Gubanov, Phil Berstein, Alex Moshchuk. IEEE ICDE 2008, Cancun, Mexico
  5. Structural text search and comparison using automatically extracted schema [bib][pdf]
    Michael Gubanov, Phil Berstein. SIGMOD WebDB 2006, Chicago, Illinois
Massachusetts Institute of Technology MIT Database Group

© Copyright 2014 Michael Gubanov. All rights reserved.