July 19-20 (Saturday-Sunday), 2014
Sammy Fung conducted a Software Technology and Devloping talk “Scrapy: DIY creating machine-readable data we need before data open” and a Lightning talk “Open Source Job Board” in COSCUP 2014 Taipei.
Website: https://coscup.org/2014/en/
Post Event report: https://sammy.hk/coscup-2014/
Scrapy: DIY creating machine-readable data we need before data open
Abstract
Scrapy is a open source python web scraping framework. Nowaday, many information are still not available in open data or machine-readable data format in the world. Open data movements are still a very beginning in some countries, and it is still lack of open data datasets in Hong Kong, so Sammy develops web scraper programs for 12+ years, and starts to release few open source web scraper programs since 2012, such as weather data web scrapers, air quality data web scrapers.
In this presentation, with Sammy’s open source web scraping / open data projects, Sammy will introduce how to write python codes to use scrapy and django to scrap the web, to produce machine-readable data or even open data.
Weather data web scraper – https://github.com/sammyfung/hk0weather
Air quality data web scraper https://github.com/sammyfung/hkaqhi
Biography
Sammy is a python developer in Hong Kong and also a startup person. He is founder of some open source communities (HKLUG, Open Source Hong Kong) in Hong Kong. He is also participating in GNOME, Mozilla, Open Data, other Open Source communities over 17+ years.