Posts tagged with 'etl'
-
How to divide and conquer your data project for success
Martin Magdinier | 17 May 2020
Data extraction is now one of the most efficient ways for companies to stay up to date with current events and trends, but also to position themselves in their field. But for a lot of small entrepreneurs and even larger companies, the implementation of data extraction projects presents new challenges: How should these processes be implemented, and by whom?
Read more... -
14 rules to succeed with your ETL project
Martin Magdinier | 15 May 2020
Extracting, transforming, and loading (ETL) data is a complex process at the center of most organizations’ data extraction projects. As we saw in our article on web scraping and ETL, the implementation of an ETL workflow is a process that requires a lot of in-depth knowledge in several subfields of statistics and programming.
Read more... -
Agile Data Process
Martin Magdinier | 24 June 2015
Stefan Urbanek when laying the foundation for the school of data program at the Open Knowledge, presented the following Data Processing Pipeline going from:
Read more... -
When manual line by line cleaning is not enough
Martin Magdinier | 04 October 2014
One of the big news in the industry this month was CrowdFlower raising $12.5 million in funding to support its growth. CrowdFlower is like a souped up Amazon Mechanical Turk with a very nice API and well-thought-out back end for job editors. I couldn’t agree more when Mark Sullivan say:
Read more...