July 19-20 (Saturday-Sunday), 2014
Sammy Fung 於台灣開源人年會 2014 Software Technology and Devloping talk 開講「Scrapy: DIY creating machine-readable data we need before data open」及參與閃電講,題目「Open Source Job Board」。
活動網址:https://coscup.org/2014/zh-tw/index.html
年會後感:https://sammy.hk/coscup-2014/
Scrapy: DIY creating machine-readable data we need before data open
Abstract
Scrapy is a open source python web scraping framework. Nowaday, many information are still not available in open data or machine-readable data format in the world. Open data movements are still a very beginning in some countries, and it is still lack of open data datasets in Hong Kong, so Sammy develops web scraper programs for 12+ years, and starts to release few open source web scraper programs since 2012, such as weather data web scrapers, air quality data web scrapers.
In this presentation, with Sammy’s open source web scraping / open data projects, Sammy will introduce how to write python codes to use scrapy and django to scrap the web, to produce machine-readable data or even open data.
Weather data web scraper – https://github.com/sammyfung/hk0weather
Air quality data web scraper https://github.com/sammyfung/hkaqhi
Biography
Sammy is a python developer in Hong Kong and also a startup person. He is founder of some open source communities (HKLUG, Open Source Hong Kong) in Hong Kong. He is also participating in GNOME, Mozilla, Open Data, other Open Source communities over 17+ years.