Buomsoo Kim

Web Crawling and Text Mining with Python in Korean (파이썬 웹 크롤링 & 한국어 텍스트 분석)

|

Files in Google Colab ver: http://bit.ly/2I4XZBx

Web Crawling and Text Mining with Python in Korean (파이썬 웹 크롤링 & 한국어 텍스트 분석)

  • Lecture materials for SNU Big Data Academy / Urban Data Science Lab (UDSL) / etc
  • Materials are provided in Korean
  • Lecture Website: SNU Big Data Academy

Gitub repository

To access all lecture materials, click here

Contents

<Part 0> Preliminaries

  1. (lab) Python Preliminaries
  2. (lecture) Fundamentals of HTML & CSS / (lab) HTML/CSS


<Part 1> Web Crawling

  1. Web Crawling - 1: (lecture) Urllib & BeautifulSoup / (lab) Daum Dictionary Crawling
  2. Web Crawling - 2: (lab) Daum Movie Crawling
  3. Web Crawling - 3: (lecture) Splinter / (lab) Dictionary.com Crawling
  4. Web Crawling - 4: (lab) Daum Movie Crawling (using Splinter)


<Part 2> Text Analysis

  1. Text Analysis - 1: (lecture) Fundamentals of Text Analysis / (lab) Text Data Processing (KoNlPy)
  2. Text Analysis - 2: (lab) Text Data Processing (KoNlPy)
  3. Text Analysis - 3: (lab) Text Data Exploration (Nltk)
  4. Text Analysis - 4: (lab) Text Visualization (World Cloud & Network Graphs)
  5. Text Analysis - 5: (lab) Sentiment Analysis with Korean Movie Reviews