Custom datasets, sourced and built to research standards
We cover the full path from fragmented sources to a clean, documented dataset — pick the pieces you need, or hand us the whole problem.
DAO & Governance Datasets
Structured records of proposals, votes, treasuries, delegates, and participation across governance portals and DAO tooling.
Historical Token Price Data
Deep price history assembled and cross-checked across multiple financial APIs, with manual reconstruction where APIs fall short.
On-chain Data Extraction
Transactions, transfers, contract events, and wallet activity pulled directly from block explorers and node data.
Exchange & Market Data
Spot and derivatives market data — OHLCV, volume, and liquidity — normalized across centralized and decentralized venues.
Research Data Cleaning
Messy, multi-source data turned into validated, analysis-ready tables with consistent schemas and documented assumptions.
API + Manual Data Reconciliation
Gaps in automated sources filled with careful manual extraction, then reconciled against APIs for coverage and accuracy.
Fuzzy Matching & Entity Resolution
DAO names mapped to tickers, contracts, and entities using fuzzy matching and human review to eliminate mismatches.
CSV / Excel / Postgres Delivery
Final datasets delivered in the format your team works in — CSV, Excel, JSON, Google Sheets, or a Postgres-ready schema.
From research question
to research-ready dataset.
A repeatable process built around one goal: data you can trust, with every assumption documented.
Define research question
We start from the analysis you're trying to run — the entities, time range, and fields that actually matter for your work.
Identify available and hidden data sources
We map where the data lives: public APIs, dashboards, block explorers, governance portals, charts, and sources most vendors overlook.
Extract from APIs, web, charts, and on-chain sources
We pull data through automated pipelines and, where APIs fall short, careful web, dashboard, and chart extraction.
Clean, normalize, deduplicate, and reconcile
We standardize schemas, normalize across USD/BTC/ETH, remove duplicates, and reconcile conflicting sources into one trusted table.
Deliver research-ready dataset with documentation
You receive a validated dataset in your preferred format, with documentation covering sources, coverage, and known limitations.
Built for people who
take data seriously.
Our clients share one thing: they need data that will stand up to scrutiny — in a paper, a memo, a model, or an investment decision.
Have a dataset that doesn't exist yet?
Tell us the research question and where the data lives. We'll figure out how to build it — book a call or send a request and we'll reply within 24 hours.