17:05 | 17:45
Keywords defining the session:
- Data Quality
- DQ Big Data
- DQ NoSQL
Takeaway points of the session:
- Will my company have a better understanding of our clients if we improve the quality of my data?
- Is it going to be more valuable the dataquality i could implement in a big data platform where a variety of sources and processes can enrich my data in ways i didn't even now?
Use of Big Data, NoSQL and a couple good of ideas in order to implement a DQ System.
In the past 2 years Minsait has been developing Data Quality engines able to run with data storaged in HDFS via Hive or Impala besides of other implementations in MongoDB or Kudu among others. We will share ideas and problems we solved