Data Analytics

DA group photo

Read our feature in the December 2012 Sigmod Record.

The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.

The Data Trio: Extraction, Integration, and Cleaning:  Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.

At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.

Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.

Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.


For technical or informational questions, please send an email to 
QCRI Careers with the name of the group to whom you’re directing your question, e.g. ALT, CS&E, Cyber Security, Data Analytics, Distributed Systems or Social Computing, in the subject line.

Principal Scientist

genericImage.jpg

Dr. Mourad Ouzzani

To be part of something different than what I had been used to at Purdue University and contribute to the first computing research institution in the region.
Read more

Principal Scientist

Dr. Sanjay Chawla

QCRI provides an ideal environment to conduct high-impact research which can transcend disciplinary boundaries.
Read more
our-research/data-analytics
Open Source Release:
  • NADEEF.  A semi-automatic extensible data cleaning system.
Meet us at the following conferences:
Learn more about our us:
default

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the Media

Yahoo Tech.JPG

The hero big data needs? Data Civilizer helps scientists conquer the clutter

29/01/2017

Big data is a big deal. With these huge data sets, analysts can gain unprecedented insight into the hidden patterns of fields like physics, healthcare, and finance. Collecting and analyzing this data...

Read More

MIT Tamer.JPG

Taming data

22/01/2017

The age of big data has seen a host of new techniques for analyzing large data sets. But before any of those techniques can be applied, the target data has to be aggregated, organized, and cleaned up...

Read More

BBC deep learning story pic.PNG

What is 'deep learning'?

08/01/2017

Every day we create billions of bits of data. Ever faster and more powerful computers can use that big data to learn, predict events and carry out key tasks. Surveillance, voice recognition and ...

Read More

Events

2017

MLDAS 2017

(MLDAS 2017) Machine Learning and Data Analytics Symposium

Download ICS File 13/03/2017  - 14/03/2017 , Qatar National Convention Centre

Machine Learning and Data Analytics Symposium - MLDAS 2017 Building on the success of the three previous events , Boeing and QCRI will hold the Fourth Machine Learning and Data Analytics Symposium (...

Read More

Past Events

ArabWic for web.jpg

Women in Data Science

Download ICS File 03/02/2017 ,

Here's a great chance to learn about the latest data science-related research in multiple domains, as part of a global project. Qatar's WiDS event will be held here at the HBKU Research Complex on ...

Read More

2016

QCRI IBM New.JPG

QCRI - IBM Data Science Connect 2016

Download ICS File 16/11/2016 ,

QCRI–IBM Data Science Connect 2016  Doha, Qatar 12.30pm –5:30pm, Wednesday, November 16 HBKU Research Complex, Ground Level Multi-Purpose Room Google Map link to location https://goo.gl/maps/...

Read More

News

MLDAS 2016.JPG

Boeing Partners with QCRI for fourth annual Machine Learning and Data Analytics Symposium (MLDAS)

09/02/2017

The Boeing Company has announced that it will once again partner with the Qatar Computing Research Institute (QCRI), part of Hamad bin Khalifa University, to host the fourth annual Machine Learning ...

Read More

Jalees10.jpg

QCRI’s Jalees Reader app launched in more languages

06/12/2016

French and German interfaces added for free app which allows users to upload books and read them offline.

Read More

IBM Watson robot (ex IBM Watson).JPG

IBM Watson scientist visits Qatar to present platform that 'thinks like a human'

16/11/2016

IBM Watson’s chief data scientist Romeo Kienzler has visited the Qatar Computing Research Institute to conduct a workshop on Watson, a question-answering platform that can “think like a human”. Mr ...

Read More