BTAA-GIN Geodata Collection Strategic Roadmap¶
The BTAA-GIN Program launched a new initiative in 2024 to collect, store, and distribute open geodata.
Follow our progress
- Done
- In progress
- Not started
Year 1 (2024): Pilot Phase¶
During Year 1, we focus on exploring the potential for a Geodata Collection. This phase involves experimenting with a variety of datasets to test workflows, metadata, tools, and data curation strategies. The goal is to establish Proof of Concepts and refine our methods before scaling up. This work will inform the development of key protocols and processes that will serve as the foundation for the project moving forward.
1. Set up storage & pilot workgroup (2024 Q1-Q2)¶
Pilot Workgroup¶
- Establish the Geodata Collection Pilot Working Group.
- Research existing geodata archiving practices across the BTAA region.
- Create an initial set of sample datasets in a GitHub repository to serve as test objects
Technology¶
- Establish Amazon cloud storage (S3) accounts for data storage.
- Develop techniques within GBL Admin for basic ingest of datasets.
- Test adding sample datasets to Amazon S3.
Recruitment and staffing¶
- Assess current team capabilities and identify gaps in skill or resources.
- Draft job description Program and Outreach Coordinator.
- Advertise the position.
Documentation¶
- Obtain official approval of the BTAA-GIN Geodata Collection Strategic Plan.
- Create and publish a Geodata Collection Implementation Plan that expands on the Strategic Plan.
2. Develop Curation Plan and explore technology enhancements (2024 Q3-Q4)¶
Pilot Workgroup¶
- Ingest sample data to GBL Admin.
- Determine staging environment for sharing datasets
- Select a broader range of pilot datasets.
- Download pilot datasets to staging area
- Test new GBL Admin features for data management.
- Determine minimum metadata requirements.
- Add pilot datasets to GBL Admin.
- Augment the GeoBTAA Metadata Application Profile as needed.
- Determine Download package contents.
- Sunset the pilot workgroup.
Technology¶
- Set up staging area (Box).
- Separate references into Distribution table Modify GBL Admin to store assets and external links in a separate table.
- Redesign the item view page Incorporate tabs for metadata, data dictionaries, and download options into item page view.
- Create a Download Package
Recruitment and staffing¶
- Continue process for interviewing for new position
- Finalize the hiring process and onboard new hire
Curation Plan development¶
- Identify the themes of geospatial data to be included in the collection.
- Describe the proposed dataset workflow and metadata assets to be created.
- Define curation criteria, including data selection, acquisition, and quality control.
Documentation¶
- Document the setup and configuration processes for S3 accounts and asset management tools.
- Document pilot testing procedures and decisions.
- Document the curation plan
Year 2 (2025): Trial Phase¶
Year 2 marks the transition to a Trial phase, where we begin collaborating with select data providers to curate datasets. This phase emphasizes building relationships, refining curation techniques, and enhancing the overall structure of our Geodata Collection.
3. Begin Data Curation Pilot and establish communication (2025 Q1-Q2)¶
Data Curation Trial¶
- Establish partnership with two data providers
- Determine data provider agreements
- Ingest datasets submitted by partners
Technology¶
- Transition the updated Geoportal interface from the development branch to the live environment.
- Implement batch ingest functionality in GBL Admin.
- Redesign the Geoportal interface to improve discovery and access for the Geodata Collection on the development branch.
Communication¶
- Identify key stakeholders.
- Develop communication strategies (frequency, channels, and content).
- Prepare communication templates and materials.
4. Outreach and Active Data Curation (2025 Q3-Q4)¶
Outreach¶
- Create an outreach schedule for engaging with the community, including presentations at conferences.
- Conduct educational sessions on data curation as webinars or workshops.
- Design and distribute outreach materials to highlight the project's features and benefits.
Active curation process¶
- Establish curation cycles.
- Ingest and publish assets as identified in Communication and Curation Plans.
Year 3 (2026): Evaluation and Outreach Phase¶
In Year 3, we will focus on evaluating the effectiveness of our curation efforts and reviewing the overall structure and processes established in the previous phases. This phase will include gathering feedback from stakeholders, refining workflows, and assessing the long-term sustainability of the collection. Additionally, we will increase our outreach efforts, promoting the collection to a broader audience and building partnerships to ensure its growth and visibility in the geospatial and library communities.
Assessment and Feedback¶
- Assess the project's impact on users and stakeholders.
- Evaluate how well the curation plan functions.
Monitoring and Evaluation¶
- Set up monitoring tools for the new systems.
- Evaluate the effectiveness of the curation process, identifying areas for improvement or adjustment.
- Establish mechanisms for regularly updating the project based on stakeholder feedback and evolving requirements.