Etoso Data API — Verified Sustainability Intelligence
Access standardised, verified ESG and sustainability data from companies' reports and other publicly available sources worldwide.

Executive Summary

Companies and analysts struggle to access standardised real-world sustainability data from various industries and regions worldwide.

Etoso Data API is a REST API that connects your systems directly to verified, traceable ESG metrics—both quantitative and qualitative—as well as full-text insights extracted from over 40,000 sustainability reports worldwide. New companies are added and data is updated monthly.

The API includes original reports and a traceability data layer. This guarantees transparency, auditability, and confidence in every metric.

It is your gateway to dependable, ontology-based sustainability intelligence — ready for automation, analysis, and reporting. Flexible pricing starts at £40k/year, with tiered plans designed to match your needs.

Data Coverage

20,000+
companies
139 countries
79 industries
60,000+ documents (5,640,000+ pages)
25,000,000+
data points
extracted from companies' reports — 625 data points per report on average

Data Layers

  • Reports organised in bundles for each company per year
  • Data points extracted from companies' reports:
    • harmonised layer — standardised (comparable) data points
    • traceability layer — precise mapping of each data point within the original document

Data Depth

Historical coverage extends back to 2018, with the most complete and standardised data available for 2021 onwards, reflecting the maturation of corporate ESG reporting practices.

API Overview

The ESG Data API is a REST-based architecture providing standardised, secure, and scalable access to ESG disclosures.

Main endpoints:

/companies
List of covered entities with metadata (sector, geography, identifiers)
/industries
Industry mapping based on SASB classification
/countries
Geographic coverage by ISO country codes
/disclosures
ESG indicator values by company and year
/context
Contextual-level search

Response format: JSON with pagination to support large result sets. Bulk export is available in Parquet and CSV formats.

Authentication & security: Token-based authentication

Performance: Designed for high-volume analytical use cases with low-latency responses and 99.9% uptime.

Why Etoso

Precise
Apples-to-apples metrics (normalised units); mapping to ESRS/GRI/SASB topics and entities.
Verified & Trustworthy
Data comes directly from corporate reports & verified public sources.
Traceable
Line-level provenance for every metric, target and quote (complete source metadata).
Comprehensive
Over 1,000 data points per report (qualitative & quantitative).
Efficient
Saves hundreds of analyst hours (up to 10× reduction).
Up-to-date
Monthly refresh cycles – scheduled ingestion and re-parsing.

Our Data Pipeline

We collect and verify thousands of sustainability and integrated reports from public sources, enriching them with detailed metadata. ESG indicators are automatically extracted using AI aligned with GRI and SASB standards, while qualitative insights retain the original report context. Quantitative data are normalised and harmonised across units, currencies, and breakdowns to enable consistent cross-company and cross-sector comparisons.

Documents, Reports, Regulations, Public data
AI-Agents
AI-Agents, from Regulatory/Policy Corpus or Data itself
AI-Agents, Algorithms, Ontology
AI-Agents, Algorithms, Ontology
AI-Agents, Algorithms
Transactional / Bulk
Unstructured
Data
Semantic Metadata Layer
Semantic Ontology Layer
Structured Data
Harmonized Data Layer
Semantic Enrichment Layer
Data API
Intelligent Information Collection
Processing, Verification, Generating metadada
Semantic Structure Reconstruction
Semantic Data Extraction
Semantic Data Standardization and Normalization
Precomputing Knowledge building
Programmatic Access to Structured Data
Learn more

We continuously collect thousands of sustainability and integrated reports worldwide from reliable public sources such as company websites and stock exchanges. Each report is verified and enriched with detailed metadata (company and subsidiaries covered by the report, reporting period, industries, language, type of the report or supporting document).

All ESG indicator disclosures are automatically identified within report pages and precisely extracted using AI-driven parsing pipelines. Indicator definitions and extraction logic are aligned with GRI and SASB standards (over 1,000 indicators). At the same time, qualitative insights (e.g., company actions, policies, and strategies) are derived directly from the original report wording—ensuring contextual richness beyond framework-specific templates.

For quantitative indicators, all available breakdowns disclosed by companies—such as geographical, operational, business process, and categorical decompositions—are captured and preserved.

To enable cross-company and cross-sector analysis, comparable quantitative measures are normalised into annual company-level datasets. Unit harmonisation, energy conversions, currency normalisation, and other calculation adjustments are applied to ensure consistency and comparability across data sources.

For Whom

Financial Platforms, Banks, Insurance Companies, and Buy-Side Funds:
power sustainability scores and ranking, investment and risks models; EDD, portfolio screening.
Consultant firms & SI partners:
automating ESG benchmarking and due diligence; embedding verified ESG facts into client workstreams.
Audit & Advisory:
excerpt exports and evidence trails for assurance and gap analysis.
Corporations:
benchmark performance against peers; compliance with regulations; reporting.

Application Scenarios

  • Analytics & BI: add ESG data layers or intelligence features.
  • AI Agents/LLMs/RAG: enable retrieval-augmented generation from our verified ESG data.

Commercial

Plans & Pricing

  • Free Tier: scoped sandbox with success criteria; fast path to production license
  • Subscription Tier: annual, enterprise internal use; tiered by segment/coverage/SLA — starts at £40k/year.
  • Pay-per-call
  • Custom enterprise integrations

Features

Usage flexibility: base subscription + optional metered add-ons (e.g., burst capacity, historical slices, workload-based boosts).

OEM/Embedded (by agreement): for integrators building single-tenant client solutions (no multi-tenant resale).

Licensing guardrails: internal use only; no redistribution of raw records; no training of public models; RAG/fine-tuning allowed for private models; audit logging & data-return on termination.

Contact / Next Steps

Ready to integrate sustainability data into your platform? Request full documentation and access to the sandbox environment:
business@etoso.io
Start for free