Datasets, Data Mining, and Process Mining
Dataset is a data collection, mostly in some database format.
- .csv format
- where to get datasets
- data scraping
- cleaning and repairing datasets
- data frames and its flavors
Using PHP and MySQL
PHP (Hypertext Preprocessor) is server-side scripting programming language and it is used for interacting with MySQL databases.
- what we can use it for
- data science applications
- updating and writing data
Data Mining Techniques
- Big Data
- processing and cleaning
- methods
Web scraping
Collecting data from web sources is called web scraping.
We can use python for HTML parsing and then clean the data.
- BeautifulSoup
In Anaconda prompt, install BeautifulSoup:
conda install beautifulsoup4
- Scrape the webpages!
- Clean the data
Creating Data Frames
- on-premises and cloud data platforms
Process Mining
- insights on how things work
note
Goals.
info
Goals.
caution
Goals.
warning
Goals.