Text Transcript
Welcome to Introduction to Big Data Tools and Hadoop ecosystem. Some of you may have attended my other talks on data science, so you probably know me well.
For those who don’t, I am Dr. Lau Cher Han and I currently wear different hats.
Mainly, I am a data scientist who specializes in text mining, with a strong interest in sentiment analysis, topic detection and machine learning type of applications. Through my career, I’ve also played as a CTO role and so I code in different coding languages such as C#, JavaScript, Python, Ruby on Rails, PHP, and others. Currently, I do lots of in-house training for companies like Intel, HP and others. I often conduct talent development programs for TalentCorp and MDEC.
Here are the things you will learn after this program
- Understand and learn about Big Data
- Understand Hadoop and its key components
When we talk about smaller data, we have different tools like Microsoft Excel, databases etc, but when it comes to Big Data, we currently only have Hadoop. Hence it is helpful to understand Hadoop and its key components.
In this session, we’ll also pick up Big Data processing tools and its techniques
This is when virtual box is important. We will be setting up a Hadoop ecosystem on your machine. Your machine needs to be considerably powerful, such as at least 8GB of RAM to be able to run it.
Next Lesson: What Is Big Data?