Help Center
How can we help? 👋

What Document Types are in the Halcyon Data Catalog?

The various document types and information formats available via Halcyon

What Document Types are in the Halcyon Data Catalog?

Halcyon’s Data Catalog contains all published documents from the publishers listed above – the PUCs, the ISOs, and FERC – going back to Jan 1, 2020. It also contains SEC filings for the largest 8,000 US public companies going back to 2020. Opportunistically, we have augmented the catalog with additional authoritative sources and documents, e.g. California Energy Commission and select documents published by DOE and the White House.

Halcyon’s document catalog contains millions of documents and counting, and we add an average of ~2,000 new documents per day.

Halcyon’s document catalog includes PDFs, doc and docx files, websites, and raw JSON data. We typically do not collect xlsx files (though we have some) or image files, though we do process images and data tables that appear in PDFs.

Traversing, collecting, and organizing a high volume of data published across 50+ disparate sources is technically challenging and subject to source websites working properly. If you can’t find information you believe Halcyon should have, please email support@halcyon.io.

Alternatively, if there are document sources that would be valuable to your work that Halcyon does not currently offer, you can request them here.

 
Did this answer your question?
😞
😐
🤩