Text Data Analysis
RUC, Spring 2024
This course provides comprehensive coverage of text data analysis, including data collection and cleaning, data representation, building text representation models, pretraining large language models, data classification and clustering, text summarization, and applying these methods to research questions in fields such as social science.
Course Staff
INSTRUCTOR
- Shaonan Wang, SHE/HER (email: wangshaonan2013@ia.ac.cn)
TEACHING ASSISTANT
- Chunyu Ye, HE/HIM(email: yechunyu2023@ia.ac.cn)
Logistics
All classes are held in person at 立德1011 Renmin University, from 09:00–12:00 every Saturday, starting September 7, 2024.
Prerequisites
Students are expected to have seen most of the following concepts before.
PYTHON PROGRAMMING
Basic syntax, Jupyter notebooks, package managers (e.g., pip)
As this is a course at the entry level, encompassing students from diverse backgrounds such as computer science, artificial intelligence, finance and economy, many may encounter unfamiliar topics. However, it’s perfectly acceptable if you find yourself unfamiliar with any of the mentioned subjects. Feel free to look up unfamiliar concepts or ask for assistance whenever necessary.