Today I found myself sitting in a classroom discussion when the need arose to figure out how far each of 344 survey respondents had come to the place where the survey was being conducted. The survey included the home zip code of the subjects, and we knew, of course, the zip code of the survey location.
While I was Googling for how to best geocode the zip codes, I happened to also ask in the #geo irc channel (on oftc.net) whether there were any free distance calculation web services. Rich Gibson, who constructed geocoder.us piped up and said he was adding to the API for its web services and did people have any requests. What luck! I asked for something that given two zip codes would return a distance.
Not long thereafter, here’s what he came up with:
http://geocoder.us/service/distance?zip1=41080&zip2=02139
which would return the distance between Petersburg, KY and Cambridge, MA.
So I was off to the races! I copied the list of zip codes from the Excel spreadsheet and pasted them into an Emacs buffer, then saved them as ‘zips’. Then in the shell (I use tcsh…):
First to get rid of duplicates:
% sort zips | uniq > zips2
Then, loop through them. I put line breaks with continuation characters into the geocoder line to fit inside this blog post.
% foreach zip ( `cat zips2` ) foreach? echo ${zip} `curl \ "http://geocoder.us/service/distance?zip1=41080&zip2=${zip}" \ | sed -e 's/.*=//' | sed -e 's/ miles$//'` >> distances foreach? end
And that’s it! I pasted the results back into the spreadsheet, got rid of the three bogus zipcodes in the data that resulted in errors from geocoder.us, used VLOOKUP to pull out the distances and I was done.
The service returned errors three times. I used the USPO zip code reverse lookup to verify that the codes were indeed bogus.
That’s my kind of web service. No WSDL, WADL, or WS-anything.
(Yes, I could have used Perl, Python, Ruby, etc. but I learned shell commands in 1981 before we had all that fancy stuff and it’s the first thing I try.)