SEC filings, earnings data, financial news, crypto market data, and alternative data signals from public US financial sources — structured for hedge funds, fintech platforms, equity research, and quantitative trading teams.
Alternative data is one of the highest-paying segments in web scraping — hedge funds, equity research teams, fintech platforms, and trading firms invest heavily in any data signal that produces alpha. We extract from SEC EDGAR, financial news sites, public crypto exchanges, and other US financial sources. Note: real-time exchange data typically requires licensed market-data feeds. We focus on the layer above — filings, news, and alternative signals that complement, not replace, your market-data subscription.
10-K, 10-Q, 8-K, S-1, proxy statements, insider transaction filings (Form 4) — full text and structured extraction of key financial line items.
Earnings release schedules, transcripts where publicly available, consensus estimates from public sources, and earnings surprises tracking.
Real-time aggregation from public news sources, company press releases, and corporate announcements with entity tagging.
Public exchange order book snapshots, OHLC data, on-chain analytics from public block explorers, NFT marketplace data.
Job posting velocity (hiring as growth signal), product launch announcements, executive movement tracking, public satellite/parking-lot data references.
Form 4 insider transactions, 13F holdings updates, beneficial ownership changes — structured and time-series ready.
| Source | Type | Ticker / Entity | Document/Event | Date | Key Data | Captured (UTC) |
|---|---|---|---|---|---|---|
| SEC EDGAR | 10-Q Filing | AAPL | Q2 2026 Quarterly Report | 2026-05-01 | Revenue $94.8B, Net $24.2B | 2026-05-19 10:30 |
| SEC EDGAR | Form 4 | MSFT | Insider purchase by Director X | 2026-05-15 | 12,000 shares @ $412.50 | 2026-05-19 10:30 |
| Public Crypto | OHLC | BTC/USD | Daily candle | 2026-05-18 | O:67400 H:68900 L:67100 C:68450 | 2026-05-19 10:30 |
Sources we typically cover: SEC EDGAR · Yahoo Finance · MarketWatch · Seeking Alpha (public pages) · Public crypto exchanges (Coinbase public data, Binance public API) · Public block explorers (Etherscan, Solscan) · Company investor relations pages · Public financial news outlets
Build alternative data signals on top of public web sources to inform trading decisions.
Power consumer-facing fintech products with financial data, filings access, and crypto market information.
Aggregate SEC filings, news, and earnings data at scale for sector and security analysis.
Time-series datasets for backtesting and signal development across alternative data sources.
30-min call to confirm sources, fields, frequency, and output schema.
Sample dataset delivered for your team to validate coverage and quality.
Scheduled jobs, monitoring, retries, reporting — backed by uptime SLA.
Real-time exchange data requires licensed market-data feeds (Bloomberg, Refinitiv, IEX Cloud, Polygon.io). We do not bypass exchange licensing. We can extract delayed quotes from public sources and integrate with your licensed feed for the real-time layer.
Yes — SEC EDGAR is a fully public dataset specifically built for programmatic access. We pull from the official SEC EDGAR API and supplement with structured extraction from filing text.
We extract from public exchange APIs (Coinbase, Kraken, Binance public endpoints) and public on-chain data (block explorers). Licensed exchange feeds for institutional-grade data are recommended for production trading.
Yes — alt-data signal development is one of our common engagements for hedge fund clients. Tell us the hypothesis you want to test (e.g., job posting velocity as growth signal) and we'll spec a dataset.