Select Page
customer logo
Customer
Success Story

Turning a Legacy Thesaurus into a Strategic Knowledge Asset

With PoolParty, CABI transformed its outdated, single-use thesaurus into a dynamic knowledge graph that connects 80K+ datasheets and powers smarter search and discovery across its Digital Library. This shift reestablished the thesaurus as a strategic data asset.

600K+ New relationships integrated
160K+ Concepts cleaned, validated and classified
80K+ Datasheets dynamically synced by API

The Client

Global non-profit organization dedicated to improve people’s lives by providing expert information to address challenges in agriculture and the environment

The Challenge

The legacy CABI Thesaurus was difficult to manage and offered little support for AI, search facets or data integration, representing a bottleneck instead of a resource

The Solution

Using PoolParty, CABI restructured the thesaurus into SKOS/RDF, cleaned and enriched the data, and automated integration with the CABI Compendium through SPARQL and APIs

Technical capabilities

  • Automated 80K+ Datasheet Sync with built-in quality assurance
  • Built Knowledge Graph for semantic interoperability & automated data integration

Business outcomes

  • Eliminated manual bottlenecks & enabled non-technical self-service data extraction
  • Transformed legacy Thesaurus into a strategic knowledge asset

The Challenge

CABI is a global non-profit dedicated to addressing critical challenges in agriculture and the environment, from combating pests and invasive species to promoting sustainable practices like biocontrols. A cornerstone of their mission is knowledge sharing, delivered through collaborative projects, scientific publishing, and data services.

Their CABI Digital Library hosts CAB Abstracts, a flagship database with over 15 million research records, expanding by 500,000 entries each year.

To make this vast content searchable and accessible, CABI relies on its Thesaurus, a multilingual taxonomy which, as of early 2023, included 160K+ concepts, 30K descriptors, and 1K+ geographic terms, all crucial for tagging, indexing, and search.

But despite its foundational role, the Thesaurus had become a bottleneck: It was managed through outdated software, limited to a single user, and lacked the structural integrity and flexibility needed for modern use cases like faceted search, linked data integration, and AI readiness.

When Gary Leicester joined as Content Metadata Controller in 2022, his immediate challenge was to convert the Thesaurus into SKOS. But this revealed deep structural issues, including tangled hierarchies and outdated data, that blocked progress.

The Solution

To modernize its thesaurus and make it future-ready, CABI implemented PoolParty Semantic Suite. This marked a shift from siloed, manual processes to a collaborative, API-first approach to metadata management.

By leveraging a PoolParty free trial and internal tools, the team conducted a thorough cleanup of over 160K+ concepts — resolving inconsistencies, validating structure, and preparing the data for use in SKOS.

PoolParty’s quality checking tools helped ensure structural integrity while allowing continuous iteration to meet internal and external reporting standards.

Concepts were classified into meaningful categories—such as crops, invasive species, pathogens, and geographies—making the thesaurus more useful across departments and product lines.

Perhaps most critically, CABI replaced single-user workflows with shared governance and streamlined publishing pipelines—laying the foundation for scalable, multi-user collaboration.

The Impact

In just one year, CABI fully transformed its thesaurus from a static legacy tool into a dynamic, reusable knowledge asset. 600,000+ new relationships were integrated, laying the groundwork for smarter data access, enhanced automation, and broader reuse across editorial, product, and research teams.

Thanks to PoolParty, previously manual processes became fully automated. Non-technical users can now extract structured, tabular datasets from the CABI Compendium without IT involvement via a simple, PoolParty-integrated form.

PoolParty’s API-first design also powers real-time synchronization between the Thesaurus and 80,000+ datasheets, eliminating bottlenecks and accelerating delivery of high-value information. Built-in quality assurance tools allow the team to continuously refile and align data for internal use, external sales, and AI-readiness.

Today, the Thesaurus is a central strategic resource and soon, CABI will launch a modern public-facing UI to replace the outdated thesaurus site. This transformation is just the beginning of CABI’s broader vision to leverage semantic technologies for better discovery, stronger data products, and greater global impact.

“We’ve restored a prestigious data set back to the business and made it essential to the work we do in support hugely important mission.”

Gary Leicester, Content Metadata Controller at CABI

Details

Industry: Non-profit
Solution: Graph Modeling
Contact us

Facing Similar Challenges?

Struggling with fragmented documentation, siloed content across teams and platforms, or slow manual workflows limiting search functionality?

Whether you're dealing with legacy vocabularies stuck in single-user workflows, teams spending too much time cleaning data, or looking to automate search and tagging but stuck with outdated tools, Graphwise can help you:

  • Turn legacy vocabularies into AI-ready assets
  • Connect siloed data across systems and teams
  • Enable non-technical users to explore structured data — without codе
  • Build smarter, interoperable content platforms that scale

Let’s talk