bob_wenzlau
Tourist
Reged: 02/25/06
Posts: 15
Loc: Palo Alto, CA
|
|
I have been working toward creating a dynamic global air quality layer that updates hourly on Google Earth. The effort is done as an outsider, and uses web crawling to return data from any agency that chooses to publish their data on a web page.
China has posed a unique exercise, and that it is running "again", I thought to share the effort.
The Chinese list air quality through their State Environmental Protection Administration . Their data including the city name, air pollution index and pollutant is only available in Chinese. (The English version shows the pollution index, but not the pollutant -- and a more complete version was shut down a month ago). That the cities are in Chinese characters precludes even the slightest clue as to how to interpret the data. For example, check out a air data page:
Chinese Version of Air Data
So I had the idea to use Google to automatically translate the page, and then scrape the translated page. The Chinese have three summary pages that are the basis for this content layer, and we can view the same page now translated.
Translated Chinese Air Quality Data
Now the next question is where are the Chinese cities? This posed its own challenge. Google Earth or Maps did not seem to know these locations. In this case I found (like many others probably know), that Wikipedia has a great geographic database of place names. Their city naming seems to align with the translated city naming from Google, but I did see many flavors of spelling of Chinese cities.
So now the data from the Chinese is regularly scraped, then scrubbed (as the translated web page has lots of cleanup requirements), and then joined with the city location database.
The Chinese data has other interesting challenges that reveal themselves. Each government can characterize their air quality at different interpretations of health effects. The Chinese do not use the same characterization as other countries. A value judgment for the map maker exists whether one is to be respectful of a countries rating, or seek to find a global norm. Now the map maker enters the greater politic.
Still I am pleased that this is up and running again. This layer had to be updated based on the Chinese stopping their English version. I have my concerns as to how long this dynamic scrape will be stable.
Hope you enjoy it. (Recall that it has 4 countries now, so spin the globe to China...it has so many huge cities I learned about through this exercise.)
...added Mexico over the weekend, and replaced the generic icon with a unique air quality icon.
Edited by bob_wenzlau (03/03/08 08:46 AM)
|
Hill
Master Guide
Reged: 10/31/04
Posts: 8854
Loc: Los Angeles
|
|
It sounds like you have spent a lot of time and effort on this. it is interesting to note that only Hong Kong shows really bad air while markers for the rest of the country are mostly listed as "good". And Beijing, site of the upcoming olympics isn't even reported. Hmmm....
-------------------- Translate text
"Eternity is a very long time, especially towards the end" Stephen Hawking (quoting Woody Allen)
"There are all kinds of interesting questions that come from a knowledge of science, which only adds to the excitement and mystery and awe of a flower." Richard Feynman
Iraq war and occupation costs
|
bob_wenzlau
Tourist
Reged: 02/25/06
Posts: 15
Loc: Palo Alto, CA
|
|
Beijing actually shows on the post.
Your comment stimulated me to dig into this deeper. The Chinese can make their air quality appear "good" simply by calling it so. The air quality data they present would if rated in the United Kingdom rate off their health scale. The respective health descriptors are: - China: Daily activities not be affected See Chinese Health Effects
- United Kingdom: The effects on sensitive individuals described for 'High' levels of pollution may worsen.--> High Levels: Significant effects may be noticed by sensitive individuals and action to avoid or reduce these effects may be needed (e.g. reducing exposure by spending less time in polluted areas outdoors). Asthmatics will find that their 'reliever' inhaler is likely to reverse the effects on the lung. See UK Health Effects
One makes this comparison by applying the API for Particulate Matter (PM) in Beijing. In this example an index of 100 API for PM was reported by the Chinese. This API correlates to a PM(10) of 0.150 milligrams per cubic meter, or equivalently for the UK scale 150 micro grams per cubic meter. When one transfers this PM concentration on the United Kingdom health index scale, one can see it falls above the maximum health hazard index of 130 micro grams per cubic meter. Thus there is a discrepency of approximately 3 orders of magnitude between the criteria for clean air in China versus the United Kingdom.
|
Hill
Master Guide
Reged: 10/31/04
Posts: 8854
Loc: Los Angeles
|
|
Quote:
Beijing actually shows on the post.
Yep. I should have clicked it on the list, rather than hunt for it beneath the star that represents Beijing - I didn't zoom in close enough. 
Did you know that living here on the West Coast of the US, we have even fewer days with deep blue skies because so much particulate pollution now crosses the Pacific from Asia? Of course here in LA those days are hard to come by anyway. See this web page.
-------------------- Translate text
"Eternity is a very long time, especially towards the end" Stephen Hawking (quoting Woody Allen)
"There are all kinds of interesting questions that come from a knowledge of science, which only adds to the excitement and mystery and awe of a flower." Richard Feynman
Iraq war and occupation costs
Edited by Hill (02/16/08 01:29 PM)
|
tegandrew
Tourist
Reged: 07/09/05
Posts: 44
Loc: Sydney, Australia
|
|
Great post Bob! I give it 5 stars.
Especially nice work manipulating the Chinese data. And I like the photos of each location in the balloons.
Could you create regional/province/state subfolders for each country to allow quick location of a city?
I think it would be useful to compare all the results to a global standards such as the World Health Organisation's air quality guidelines (eg pages 275, 326, 375 and 413). That way it would be plain which areas/countries need greater attention to air quality.
Have you considered creating a static (non-network linked) KML that would be imported to the GEC layer so that people browsing their city may serendipitously discover your layer?
|
bob_wenzlau
Tourist
Reged: 02/25/06
Posts: 15
Loc: Palo Alto, CA
|
|
Good thoughts, and thanks for the stars!
I need an idea how I might post this to the Google Earth Community -- recall that the air quality content is dynamic -- and a static version would lose the temporal aspects. There might be a clever way to use a static link to propel a user to dynamic content...
I also updated the main principal kmz. The effort has a name: AirWatch, and now has its own icon.
AirWatch now incorporates hourly updates for Mexico City's air quality. Like China, Mexico had language issues, and also has a unique way of posting their air quality --- they basically add a row to a table, then restart the table daily. It is a bit like solving a crossword to figure out how to capture the data.
My approach to resolve the varying air quality health indices is to work toward building a common air quality data table. The table will allow converting the various country health indices to their underlying concentration -- a common datum. With air quality in concentration, the the user could apply which ever country health index they wish -- or the World Health Organization standards.
-------------------- Bob Wenzlau
bob@terradex.com
|
tegandrew
Tourist
Reged: 07/09/05
Posts: 44
Loc: Sydney, Australia
|
|
Congratulations Bob on your EPA award!
As a suggestion for posting a static file, just post the location of all of the air monitoring stations. Describe them, what they do, the data they gather and provide links to download your network link and to read more about it on the forums. At the very least it would be a nice collection of weather stations. You could also create links to Wikipedia articles on air pollution in each country.
I've done this with my NPI, EPER and renewable energy collections. Since appearing in the GEC layer with a link to correct the location, I get about 200 hits a day (from about 15,000 placemarks).
|