Do you know what dirty data is? How about Hadoop User Experience (Hue)? When working with or around big data it helps to know the terms used to describe it. This list has 50 big data terms everyone should know about analytics and software relating to big data. Dirty data is, “Dirty data is data that is not clean or in other words inaccurate, duplicated and inconsistent data.” Not good. Hue is an interface for easier use of Apache Hadoop. MachineMetrics knows how to work with these terms and this data, so contact us to learn how it can work for your business.
This article is a continuation of my first article, 25 Big Data terms everyone should know. Since it got such an overwhelmingly positive response, I decided to add an extra 50 terms to the list. Just to give you a quick recap, I covered the following terms in my first article: Algorithm, Analytics, Descriptive analytics, Prescriptive analytics, Predictive analytics, Batch processing, Cassandra, Cloud computing, Cluster computing, Dark Data, Data Lake, Data mining, Data Scientist, Distributed file system, ETL, Hadoop, In-memory computing, IOT, Machine learning, Mapreduce, NoSQL, R, Spark, Stream processing, Structured Vs. Unstructured Data. Read More