In this course, students will be introduced to a variety of basic principles, and techniques involved in carrying out data mining on textual datasets or textual attributes. Topics include document representation, tokenization, parsing, text categorization, text clustering, topic modeling, and sentiment analysis. Concepts of Natural Language Processing (NLP) and Information Retrieval (IR) relevant to text mining will also be covered. Prerequisite: CS-240 (3-0-3)