• Skip to primary navigation
  • Skip to content
  • Skip to primary sidebar

Publish What You FundPublish What You Fund

The Global Campaign for Aid Transparency

  • RSS
  • Twitter
  • Vimeo
  • LinkedIn
  • Facebook

NEWSLETTER

CONTACT

  • Why it matters
    • Why transparency matters
    • The Story of Aid Transparency
    • What you can do
    • FAQs
    • Case studies
  • The Index
    • 2018 Index
    • Comparison Chart
    • Methodology
    • Index Archive
    • Tools
  • Aid Donors
    • European Union institutions
    • France
    • Germany
    • United Kingdom
    • United States of America
  • Our Work
    • US Foreign Assistance
    • Data Use
    • Joined-up Data Standards
    • Open Ag Funding
    • IATI Decipher
    • Webinars
  • News
    • News
    • Events
    • Blog
    • Reports
  • About Us
    • Board
    • Team
    • Friends of…
    • Our transparency
    • Annual Reports
    • Our Funders
    • Jobs
Show Search
Hide Search
Home / Blog / Locating the geodata – an IATI experiment
blog

Locating the geodata – an IATI experiment

By James Coe | Mar 16, 2017 | Blog

[et_pb_section admin_label=”section” transparent_background=”off” allow_player_pause=”off” inner_shadow=”off” parallax=”off” parallax_method=”on” make_fullwidth=”off” use_custom_width=”off” width_unit=”off” custom_width_px=”1080px” custom_width_percent=”80%” make_equal=”off” use_custom_gutter=”off” fullwidth=”off” specialty=”off” disabled=”off”][et_pb_row admin_label=”row” make_fullwidth=”off” use_custom_width=”off” width_unit=”off” custom_width_px=”1080px” custom_width_percent=”80%” use_custom_gutter=”off” gutter_width=”2″ allow_player_pause=”off” parallax=”off” parallax_method=”on” make_equal=”off” parallax_1=”off” parallax_method_1=”on” parallax_2=”off” parallax_method_2=”on” parallax_3=”off” parallax_method_3=”on” parallax_4=”off” parallax_method_4=”on” disabled=”off”][et_pb_column type=”4_4″][et_pb_text admin_label=”Text” background_layout=”light” text_orientation=”left” use_border_color=”off” border_style=”solid” disabled=”off” border_color=”#ffffff”]

This blog post was written by James Coe of Publish What You Fund and Nick Hamlin from Global Giving as part of the Initiative for Open Ag Funding. It was initially published on InterAction’s website here. 

 

Using Project Documents to Simplify Location Data Publication

Location data matters. Despite being one of the least published pieces of data on the International Aid Transparency Initiative (IATI) Standard, it is consistently highlighted as one of the most important. Without information on where donors are spending their money, aid practitioners are less able to avoid project duplication or identify gaps in funding. As part of the recent Tool Accelerator Workshop, organized by the Initiative for Open Ag Funding, a team of staff from GlobalGiving, Development Gateway and Publish What You Fund worked together to see if we could enhance and simplify the publication of this vital, yet lacking, data.

The idea was simple: identify location names based on unstructured project documents and turn them into geocodes so that they could, for example, be plotted on a map. Like every data science project, our first step was to find some reliable input data upon which to build. We found a treasure trove of potential sources embedded in the document links included in IATI activity files. Within these, evaluation documents seemed to be the best potential source of location data, so we created a library of these files to quickly sample from.

Data extraction

Documents in hand, we now had to try to extract the relevant location names and other useful information. Unfortunately, the types of documents that organisations attach to IATI activities are extremely diverse and come with a wide variety of structures and file formats. To begin to make sense of them, we first converted them into raw, uniform text. We then used a technique called named entity recognition to distill the messy raw text into the clean lists of organisations and locations hidden inside.

Thanks to Python’s natural language processing tools, we were able to begin iterating quickly on our corpus of documents and quickly discovered what data points could be easily extracted. The excerpts of our code on Github describe this process in more detail. In the end, our result looked something like this (emphasis added):

Location Data Points

We now had a list of sub-national locations; some relevant, some not. Our next step was to convert these names into geocodes. Much of our exploration at this stage focused on whether these results could be integrated with Development Gateway’s Open Aid Data Geocoder, which can assign precise geospatial data based on the location names in accordance with the IATI schema.

Several practical hurdles remain before an approach like this could be used in production, but by the end of this short exercise we had discovered a simple way of automating IATI data creation by scraping text from project documents, extracting location data and then converting these results into geocodes.

What next?

If we are to take this idea further, our next step would be to identify the earliest instance in which subnational locations appear in available project documentation. Since evaluation documents are only available at the end of a project’s lifecycle, they are not an ideal choice for completing forward-looking IATI data, although they could be used to enhance data already on the IATI registry. We will also need to test ways to implement this approach at scale before integrating it into donor publication processes.

In any case, what we have learned so far is that location data is central to improved coordination and, with this automated extraction and geolocation process, it is well worth the effort to explore how we can improve its publication and availability.

[/et_pb_text][/et_pb_column][/et_pb_row][/et_pb_section]

Reader Interactions

Primary Sidebar

NEWS Topics

Africa Agriculture Aid transparency Aid Transparency Index Australia Budget ID Canada China Climate Change Corruption Data Revolution Data use Data Visualisation European Commission Financing for Development France Freedom of Information Gender Germany GPEDC Health Humanitarian International Aid Transparency Initiative Italy Japan Joined-up data Korea Letters MDGs New Zealand Open contracting Open data Open government Press Releases Publish What You Fund Resources Road to 2015 Sustainable Development Goals Sweden UK United Nations United States US US foreign assistance World Bank

NEWS CATEGORIES

  • Blog
  • Case studies
  • Events
  • News

REPORTS

  • Aid transparency
  • Aid Transparency Index
  • China
  • Climate Change
  • European Union
  • Multimedia
  • United States

Twitter

  • We have a great #job opportunity for a project assistant. If you'd like to build your career in… https://t.co/O4g5WVE1bO
    Feb 21, 2019
  • Our developer @andylolz has made a new tool to convert #IATI v1.0x data to v2.03. It’s here:… https://t.co/r1AYbgB3hN
    Feb 20, 2019
  • Job alert! We're looking for a new Project Assistant to help with some of our upcoming projects. Do you know anyone… https://t.co/FPcTWYrLlg
    Feb 19, 2019
FOLLOW US
  • Contact Us
  • Copyright
  • Privacy Policy
  • RSS
  • Twitter
  • Vimeo
  • LinkedIn
  • Facebook

Publish What You Fund. China Works, 100 Black Prince Road, London, SE1 7SJ
UK Company Registration Number 07676886 (England and Wales); Registered Charity Number 1158362 (England and Wales)