HW 5

  

1. What are the main challenges of text analysis?

2. What is a corpus?

3. What are common words (such as a, and, of) called?

4. Why can’t we use TF alone to measure the usefulness of the words?

5. What is a caveat of IDF? How does TFIDF address the problem?

6. Name three benefits of using the TFIDF.

7. What methods can be used for sentiment analysis?

8. Research and document additional use cases and actual implementations for Hadoop.

9. Compare and contrast Hadoop, Pig, Hive, and HBase. List strengths and weaknesses of each tool set.

10. Research and summarize three published use cases for Hadoop, Pig, Hive, and HBase.

Needs help with similar assignment?

We are available 24x7 to deliver the best services and assignment ready within 3-4 hours? Order a custom-written, plagiarism-free paper

Get Answer Over WhatsApp Order Paper Now