OpenRefine is a free, open source power tool for working with messy data and improving it
#
datawrangling
Repositories 42
course website for data science tools 1
datascience
bash-script
pandas
seaborn
git
eda
matplotlib
jupyter-notebook
sklearn
dataingestion
datawrangling
Jupyter Notebook
Updated Mar 15, 2019
haojing9058 / Comparison-of-Regression-Machine-Learning-Algorithms-of-Explaining-Students-Academic-Performance
2
svm
svr
regression
uci-machine-learning
python-script
datawrangling
data-analysis
machine-learning
scikit-learn
pandas-dataframe
Jupyter Notebook
Updated Nov 29, 2017
Understand the relationships between various features in relation with the sale price of a house using exploratory da…
python
ridge-regression
linear-regression
data-visualization
data-analysis
datawrangling
predictive-modeling
machine-learning
correlation
standardization
data-extraction
data-exploration
encoding-library
regression-analysis
lasso-regression
regularization
root-mean-squared-error-metric
parameter-tuning
cross-validation
k-fold
Jupyter Notebook
Updated Jan 19, 2018
Jupyter Notebook
Updated Mar 14, 2019
Udacity Data Analyst NanoDegree Data Wrangling course project
Jupyter Notebook
Updated Sep 12, 2017
通过Python对OpenStreetMap的数据集进行整理和清洗。
Jupyter Notebook
Updated Sep 11, 2017
Research Lab Script in R
Updated Dec 23, 2017
This assignment is based on a dataset of credit card transactions that you can download from Link to download the dat…
sklearn-library
segmentation
kmeans-clustering
kmeans
decision-trees
programming
python3
dataanalysis
datawrangling
Jupyter Notebook
Updated Mar 6, 2019
This analysis mainly aims to find a way to decide which one of these clients without financials have actually over 5 …
Jupyter Notebook
Updated Mar 6, 2019
Data wrangling using python and SQL
Jupyter Notebook
Updated Aug 17, 2017
Jupyter Notebook
Updated Mar 18, 2018
Jupyter Notebook
Updated Aug 20, 2017
data
data-analysis
python
jupyter-notebook
prosper-loan-data
prosper
udacity
nanodegree
portfolio
fifa
analytics
eda
exploratory-data-analysis
explanatory-data-analysis
seaborn
datavisualization
datawrangling
datacleaning
Jupyter Notebook
Updated Feb 26, 2019
SQL and dplyr queries; basic descriptive statistics mapping
HTML
Updated Mar 5, 2019
Data wrangling in Python
ipython-notebook
datawrangling
pandas
seaborn
matplotlib
exploratory-data-analysis
data-cleaning
data-wrangling
statistical-inference
Jupyter Notebook
Updated Apr 21, 2017
Example of how to use CasperJS to scrape data from a website
JavaScript
Updated Jan 12, 2019
Python
Updated Mar 22, 2019
HTML
Updated Jan 20, 2019
Submitted for Coursera Certification
R
Updated Oct 26, 2016
regression
logistic-regression
stata
agriculture
child-malnutrition
uganda
udhs
demographics
data-analysis
datawrangling
data-visualization
child-zscores
Updated Oct 26, 2017
This is an exercise on the use of python for data wrangling based on the book "Python for Data Analysis" by Wes McKinney
Jupyter Notebook
Updated Jul 21, 2018
Aggregate data in R using simple SQL commands
R
Updated Dec 2, 2018
Jupyter Notebook
Updated Mar 17, 2019
This project is aimed to use data munging techniques, such as assessing the quality of the data for validity, accurac…
HTML
Updated Oct 21, 2017
This repo contains the code to download data and then extract it, if needed, and store it in a pickle file.
Python
Updated Mar 24, 2017
Jupyter Notebook
Updated Jul 16, 2017
Data Wrangling with MongoDB class code
HTML
Updated Mar 6, 2019
Analysis of NOAA storm database with R to determine most severe types of weather event
HTML
Updated Feb 16, 2017

