description
owners Tiago Ferreira
last changeTuesday, May 2, 2017 17:55 +0200
size501 KB
stats9 commits and 0 tags in 25 days metrics
repository url
http://git.mmpsoftware.com/gitblit/r/eprofilers_development.git
R
Git
2017-04-21 benakesh
EPD-32 data cleansing on brand names
ab388b diff | tree
2017-04-19 vineeth
EPD-27-Test
e1cae7 diff | tree
2017-04-19 vineeth
EPD-27- Remove duplicates from brandnames
b9c5fb diff | tree
2017-04-05 milovanovicm
Added processed.csv - information which file is at what stage of processing;
3eaa73 diff | tree
2017-04-05 milovanovicm
Added beloon_schema.csv - master schema mapping file for beloon datasets; a...
5bc755 diff | tree
2017-04-04 milovanovicm
Added schema.csv - master schema mapping file;
d25a2e diff | tree
2017-04-03 milovanovicm
Added data_cleansing.py script
a3c005 diff | tree
2017-03-30 milovanovicm
Added scripts for Hadoop and Spark preparation and installation
0f2fb1 diff | tree
2017-03-27 Tiago Ferreira
Initial commit
3700d5 diff | tree
2017-05-02 milos-dev milovanovicm EPD-24 Product extraction - removing features from title column #time 2h 30... log | tree | raw
2017-05-02 Etl benakesh EPD-38 Master Data for Colors log | tree | raw
2017-04-21 master benakesh EPD-32 data cleansing on brand names log | tree | raw