Data stored in the lake is actionable and discoverable, increasing accessibility to large and complex datasets.
Open Energy Data Initiative (OEDI)
The OEDI Data Lake is a centralized repository of high-value energy research datasets aggregated from the U.S. Department of Energy’s Programs, Offices, and National Laboratories. Built to enable data discoverability, OEDI facilitates access to a broad network of findings, including the data available in technology-specific catalogs like the Geothermal Data Repository and Marine Hydrokinetic Data Repository.
- Universal access to DOE-funded research projects
- Intuitive upload and submission processes
- Standardized submissions and metadata, curated by a team of experts
- Effective safeguards to secure datasets, including moratoriums on publication dates
What is a Data Lake?
A data lake is a collection of curated and diverse datasets built to accelerate accessibility and collaboration. Data lakes typically hold raw data files, scientific reports, supporting media, and links to online documents chosen by the contributing researchers. The lake enables sustained access to large data files, often through partnerships with a variety of cloud vendors.
Information flows into the data lake from a variety of sources: private industry, laboratories, analytic tools, use cases, research reports, and more. Improved availability and visibility allows researchers to easily access the findings, conduct inventive analytics, and collaborate with the authors of a publication. New insights and innovations flow outward from the lake, creating opportunity for even more rounds of research and development.
Open Data Registry Public Datasets
OEDI is powered by OpenEI, an energy information portal sponsored by the U.S. Department of Energy and developed by the National Renewable Energy Laboratory in support of the Open Government Initiative to make energy data transparent, participatory, and collaborative.