DiscoverableConformedPricedTradable

DAACData As A Commodity

The data isn’t missing. It’s scattered.

The data the world needs from frontier economies is scattered across the web, stranded in disconnected systems, or never digitised at all, because compiling it has never been worth anyone’s while. DAAC makes it a commodity: found, consolidated, priced and traded, with ownership staying at the source. When the record earns for the people who keep it, transparency stops being an obligation and becomes an economy.

DAAC · discoveryScattered
SCANNING FIELD
SCATTERED · UNINDEXED
NzibiraSouth Kivu, DRC
Open web · 00 sources · 0 data points
Scattered across the web, or never digitised. Now findable.
01The problemwhat can’t be seen

Nobody pays for ground truth. So nobody compiles it.

In frontier economies the knowledge the world depends on never becomes data, for a plain economic reason: the people who hold it have never been paid to compile it, and the systems that could carry it were never built for them. It lives in heads, on paper, and in disconnected silos, in places its buyers will never visit.

So every decision downstream runs on guesswork: risk is priced blind, provenance is unverifiable, impact claims cannot be checked, diligence is a paper trail. The parties who need the data, and increasingly the AI agents acting for them, have nothing to query.

Where nothing can be verified, honest and dishonest trade at the same price.

PriceUnderwriteFinanceComplyno recordOriginuncaptured
Every downstream decision, pricing, underwriting, finance, compliance, is made on the origin’s data. That record is where the data is true, and where it is never captured.
02The solutionwhere the answer lives

DAAC consolidates the public record, connects closed systems, and digitises what was never captured.

Say you need the picture on a mining community in South Kivu: who operates there, what the security context is, what changed last month. The answer exists. It sits in three states, each out of reach in its own way, and DAAC builds the route to each.

01 · Scatteredthe public record

Consolidate the public record

Reports, registries, news and datasets are public but dispersed, unrated, and quietly going stale. Crawl scans a sector’s sources, verifies every finding by corroboration, and organises the result into maintained public hubs: one trustworthy view where there were a thousand tabs.

Crawlthe hubs
02 · Strandedthird-party systems

Map the systems that hold it

Government registries, NGO databases and traceability platforms hold rich records that cannot talk to each other. Scribe renders each source into the shared schema, so what a third-party system holds becomes discoverable and comparable, with no rebuild on their side.

Scribeintegration
03 · Undigitisedlocal knowledge

Digitise what was never captured

The deepest answers live with the people on the ground, in heads and on paper, invisible to any search. Frame apps, composed by Forge agents, digitise that knowledge at the source, owned and controlled by the people who record it.

Frame + Forgethe apps
One query

Spine makes the three behave as one: a single question, asked by a person or an AI agent, returns every source that holds part of the answer, on each owner’s terms. The console at the top of this page is that query.

Underneath the three routes is one mechanism: ownership creates the incentive, and the incentive creates the data. DAAC stands for Data As A Commodity, and the name is literal: field data becomes an asset its owner prices and sells, and DAAC is the market between the two sides of that trade.

SupplySovereign digitisation ecosystem

Field apps that capture data at the source. Datastake builds the most mature of them; anyone can build apps that join.

Intelligence marketDAAC

Five products turn that supply, and the scattered public record, into a priced, sovereign asset matched to the buyers who need it.

DemandBuyers & agents

Traders, auditors, regulators, insurers, and the AI agents acting for them, buying intelligence for risk, impact, due diligence and opportunity, on the owner’s terms.

03How it worksthe five products

Five products carry data from first record to paid exchange, while it stays with its owner.

DAAC is Frame, Forge, Crawl, Scribe and Spine, and each earns its place against one of the three states the data is trapped in. Frame is the framework that apps, and whole digital ecosystems, run on; Forge is the agents that build on it: together they digitise what was never captured. Crawl consolidates the scattered public record into maintained hubs. Scribe renders the sources stranded in third-party systems into the shared schema. And Spine makes it all one market: discover and exchange across every system while the data stays at its source. The honest status of each is on the dial: Crawl’s hubs already run, the rest are in design.

FIG.01 · five products, two supply paths, one foundation
A · Create the supplyB · Make the marketForgeagents build appsFramethe app frameworkappscapture field databuildsrun onPublic sourcesregistries · portals · reportsCrawlscan · verify · organiseScribemap · reconcile · conformpublic weborganisedpublishesPublic hubopen · source-rated · kept freshSpineexchange, at sourcecaptured dataconformedIntelligencethe demand-side payoffpriced accessrisk managementimpact measurementdue diligenceopportunity assessmentwithin-appcross-systemThe foundationShared conventionsidentity · schema · provenance · sharing rules
DAAC product — Spine, in copper, is the load-bearing gateApps built on Frame — the ecosystem that captures the supplyExternal — public sources scanned in, intelligence served out

What the five produce: apps that capture the field and hubs that open the public record are the supply; what DAAC offers on top of them is intelligence, decision-grade answers on risk, impact, due diligence and opportunity.

Supply-sidefrom

Apps

Field applications that capture frontier data at source. The ecosystem’s first-party supply: nine built, seven live.

See the proof →
Supply-sidefrom

Hubs

Curated, maintained public reference hubs for opaque sectors. A public good and a front door, and public-data supply.

See the hubs →
The offerfrom

Intelligence

Decision-grade data the market delivers to demand, priced and permissioned. What buyers actually pay for.

Risk managementImpact measurementDue diligenceOpportunity assessment
Who pays for it →

One principle ties the five together: demand shapes the schema. Rather than imposing one ontology top-down, DAAC lets what the market actually queries pull the standard into existence.

Explore the products →
04The prooflive today, not projected

The supply side already exists: nine live systems and four public hubs.

The five products are in design. What already works is the ground they stand on: the same core stands up a bank-onboarding flow, a gold-supply audit, a climate-monitoring platform or a conflict register, each one intricate and purpose-built, not a generic template. Datastake runs nine of these today on DAF, across five sectors and three continents; Frame is its next generation. Every ecosystem it produces becomes a channel through which data that never existed in any system enters the market DAAC is building.

A · The ecosystem apps
KOTALive

Bank KYC & onboarding for artisanal mining

Mining · Banking · DRC
WAZILive

Gold supply-chain due diligence

Mining · Tanzania
NashirikiLive

Civil-society monitoring & evaluation

Governance · M&E · DRC
TazamaLive

Conflict & GBV incident monitoring

Civil society · DRC
StraatosLive

Climate & nature MRV infrastructure

Climate · Senegal · global
SBGLive

Responsible-gold programme platform

Mining · Peru · Colombia
KustawiLive

ASM ESG & investment facilitation

Mining · Finance · DRC
OrekaIn build

Responsible Minerals Credits infrastructure

Mining · Finance · DRC
TunzaIn build

Child-labour monitoring & remediation

Child protection · DRC
Backing these ecosystem apps: named clients, funders & partners
TMBEPRMUN WomenIFEDDSwiss Better GoldThe Impact FacilityWeForestGold StandardHederaBon PasteurMercedes-BenzGoogle.org
B · The public hubs

Where a sector has no trustworthy record, DAAC publishes one: open, source-rated, kept current.

Alongside the data the ecosystem captures, DAAC publishes curated, maintained reference hubs: a public good, a front door to the work, and the ground a sector’s community forms on once it has one shared record to gather around, produced and kept current by Crawl, the public-source engine. Four are live today.

● LiveCritical minerals · DRC

DRC Cobalt Hub

The independent reference for artisanal and industrial cobalt in the DRC: stakeholders, initiatives, a source-rated news feed and a library, kept current. A sector everyone depends on and no one had mapped.

cobalt.daac.ai  ↗
● LiveArtisanal & small-scale mining · global

ASM Hub

The reference for the systems that formalise artisanal mining worldwide: 26 source-rated traceability, certification, formalisation and monitoring platforms, mapped across the sectors that rely on them. In FR, EN and SW.

asm.daac.ai  ↗
● LiveEITI · ITIE-RDC

Extractives Transparency Hub

The open record of who owns what and who gets paid in DRC extractives: contracts, revenues and beneficial ownership, the 2026 Validation, and a source-rated updates feed in EN, FR and SW.

nashiriki.com/eiti  ↗
● LiveClimate & nature · Global South

Climate & Nature Hub

The curated, link-checked front door to Global-South climate work: nature-based projects, funders, standards, developers and datasets, each carrying a freshness stamp so the record can be trusted, not just found.

straatos.io/climate-hub  ↗
More hubs as the data deepens
Responsible goldthe gold value chain, from mine to refinerPlanned
Responsible sourcingdue-diligence standards and regulation, across commoditiesCandidate
Global-South agriculturesoft commodities, from smallholder to exportCandidate
05The opportunitywho needs this, and who pays

Regulation, AI and the energy transition all need the same asset: verified data from the origin.

The value chains the world depends on in these economies, critical minerals, carbon, food, finance, are what generate the demand. They need the origin priced, and priced on evidence.

This demand is not hypothetical: buyers already pay for the answer the hard way, expert visits billed by the week, one site at a time, findings stale on arrival. The six buyers listed here are that same budget, waiting for a better product.

What DAAC prices for them is the data itself, not the commodity it describes. The value chain is the reason the data is worth paying for; the asset on sale is the record.

And the window is opening now: AI agents can finally build the supply, composing apps and rendering sources at machine pace, and they are becoming the demand, querying provenance continuously instead of filing it quarterly.

Increasingly, the buyer is an AI agent acting on their behalf.

Who pays for it
01
Commodity traders & off-takers
Supply-security intelligence to price and commit to frontier supply.
02
Capital allocators
DFIs and sovereign funds deploying nine figures who need field-verified intelligence.
03
Insurers & reinsurers
Pricing risk in environments nobody currently measures.
04
AI labs & data platforms
Correcting the structural bias of models trained on data from everywhere but here.
05
Satellite & remote sensing
Ground-truth labels to validate what imagery can only infer.
06
Compliance
EU Battery Regulation and CSDDD are turning sourcing diligence into law.
06The visionthe ambition

The reference database for the unindexed world, owned by the people who build it.

Search organised the world’s published information. It never reached the places where the information is not published: the mine sites, communities and value-chain origins the world now has to price. DAAC’s ambition is the definitive record of those places, compiled by the people who live there and owned by them throughout: Google, but distributed.

Frontier economies are where traditional information infrastructure failed, and that is exactly why the new model starts there: no incumbent to displace, and everything still to index. Where the old model sends an expert with a notebook, this one is already resident.

The supply side is not a data centre, and not a scraper. It is digital ecosystems rooted where the answers are: apps digitising what people on the ground know, hubs consolidating what is public, third-party systems finally rendered legible. And sharing has an incentive at last: every holder can see where their data goes, set the terms, and get paid, so the supply grows instead of drying up.

Compiled by anyone. Valuable to everyone.

What that becomes
The index of the unindexedA reference layer for the places the web never covered. The ecosystem already reaches villages no map has named.
Supply that stays sovereignThe people and organisations who hold the ground truth compile records they own, see where each one goes, and earn from it.
One record, many buyersThe same data point answers a trader, an insurer, an auditor and an agent, and earns for its owner each time.
07The edgewhy this compounds

The moat is not the software. It is presence, trust, and data that exists nowhere else.

Physical presence. Field agents, site visits, and hardware in places with intermittent power and no addresses. Software can copy software; it cannot show up.

Proprietary data. Every app turns knowledge that was never recorded into structured records. You cannot scrape data that does not yet exist, and once it does, sovereignty keeps it on the owner’s terms.

Earned trust. Years of relationships with cooperatives, regulators, development financiers and civil society across East and Central Africa. No competitor, human or AI, can shortcut that.

Distributed custody. The reflex is to centralise the data and own it. DAAC does the opposite: custody stays distributed and ownership stays intact, priced and traded on the owner’s terms. Respecting ownership is exactly what makes owners willing to compile and exchange more, so supply keeps growing instead of drying up.

Messy markets aren’t a liability. They’re the strategy.

01Physical presence earns trust where competitors won’t go.
02Trust unlocks the right to capture the data.
03Captured data is proprietary: it exists nowhere else.
04Proprietary data makes the market more valuable.
05Value attracts more clients in similarly hard sectors.
↻ more clients → more presence → more data
08Business modelhow it makes money

DAAC earns twice: selling data access now, licensing the infrastructure as it matures.

The exchangeengine · the market

Sells access to the data the ecosystem captures. Buyers pay for what they query, on the owner’s terms; the data itself never changes hands.

  • Per-query marketplace access
  • Data licensing to traders, insurers, AI labs
  • Transaction fees on every exchange
The five productsengine · the rails

Licenses the infrastructure itself to everyone building in the ecosystem. Pre-revenue, by sequence: the rails monetise once supply has critical mass.

  • Spine protocol licence & conformance
  • Frame application-framework licences
  • Forge agentic build seats
  • Scribe conformance throughput
  • Crawl public-data feeds & hub sponsorship
First: supply
Digitise & recycle
Turn knowledge that was never data into structured assets, and unlock what is trapped in existing systems. Underway, through the live ecosystem apps.
Then: infrastructure
Lay the rails
Spine, Frame, Forge, Scribe and Crawl: discovery, conformance, exchange, chain of custody, and the public record. The tooling that makes a sovereign data market trustworthy. In active design.
Then: demand
Open the market
Reach the buyers who need data from sectors and geographies that were always opaque. The commercial push, once supply has critical mass.
09The ask

The data was always there. The market wasn’t.

DAAC is built on a decade of fieldwork, a live ecosystem already capturing the supply, and the AI shift that finally makes the market buildable. Its five products, Spine, Frame, Forge, Scribe and Crawl, turn that data into a commodity. For investors in frontier-market infrastructure, data, or the agent economy, the conversation starts here.

Nine years of fieldwork before the first product shipped
2017BetterChain founded. Mineral buyers were paying EUR 10,000 a week for one expert to assess risk at one mining site. BetterChain crowdsourced that answer from local sources instead. The founding insight: more sources make every piece of information more valuable.
2018Datastake takes shape. The crowdsourced data marketplace becomes the work: a standard for assessing and sharing supply-chain risk mattered more than the chain.
2019Into the field. First institutional backing. Field operations begin in the DRC.
2020First production app. Deployed in Ituri, DRC. Real data, captured at the source.
2022The DAAC thesis. The apps were never the business. The data they generate is.
2024Eight apps, four sectors. Mining, climate, banking, civil society, on one shared framework.
2026DAAC, defined. Spine, Frame, Forge, Scribe, Crawl. The market gets built.

Where it stands.The ecosystem is live, in production, and generating revenue today. DAAC’s five products, Spine, Frame, Forge, Scribe and Crawl, are in active design (Crawl’s public hubs already run), sequenced supply-first by deliberate choice: the market opens once the data reaches critical mass. The moment to build it is now.

daac.ai · the data market for frontier economies