Friday, July 21, 2017

Web Scraping and Content Mining

DESCRIPTION
Web Scraping and Content Mining
Most interesting course in NYC.
2 sessions workshop
Web Scraping is a method for extracting textual characters from websites so that they could be analyzed. Web scraping is sort of content mining, which means that you collect useful information from websites, including quotes, prices, news company info, etc.This method for gathering data is direct, either through looking at websites' html code or visual abstraction techniques using Python programming language.
We start workshop by exploring different methods to gather data from Web. We go through the whole process of gathering, storing and analyzing data. For our examples we use real-life financial quotes and Annual reports 10-K. During the course we learn how to use numerous Python libraries - Urllib, Requests, Wget, BeautifulSoup 4.0, SSL, PDFminer3k, Twitter and others.
Also, we learn to constract Regular expressions patterns to find targeted information on Web pages. As a part of content mining, we build Twitter application to search and analyze the trends.
The price is for two classes:
You will Learn:
BeautifulSoup Python Library
How to use Urllib and Requests
Regular Expressions patterns
Read and analyze PDF files
Store Data with CSV files and SQL Database
Create Twitter app
Build Custom Google Search Engine

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.