From 3664f2bd88c93e89202eb2c9d8ae538a44060c57 Mon Sep 17 00:00:00 2001 From: Chris Deline Date: Tue, 2 Sep 2025 10:40:35 -0600 Subject: [PATCH 1/2] Update entry for 7334 with 5-minute averages. Fixes #40 Added a new dataset entry for ID 7334 with details on averaging. --- pvdaq.md | 1 + 1 file changed, 1 insertion(+) diff --git a/pvdaq.md b/pvdaq.md index 20b0515..292d976 100644 --- a/pvdaq.md +++ b/pvdaq.md @@ -25,6 +25,7 @@ The Data Prize entries were submitted in one of two categories: systems < 5 MW D #### > 5 MW DC system IDs: * **7333** - A 257 MW Single-axis tracker facility in California. This dataset is at a very high time resolution of 10s for all channels. +* **7334** - This is the same dataset as 7333, except averaged to 5-minute intervals. This makes it more convenient to download and analyze. * **9069** - A 38.7 MW Fixed ground-mount facility in Georgia #### Details on the Prize Datasets From 7f2897fe832861b3c9c9dee579566f5047a0bf1d Mon Sep 17 00:00:00 2001 From: Chris Deline Date: Tue, 2 Sep 2025 10:46:31 -0600 Subject: [PATCH 2/2] Document PV Output and PVDB dataset details Added sections for PV Output and PVDB datasets with details on data sharing and collaboration. --- pvdaq.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/pvdaq.md b/pvdaq.md index 292d976..14e821a 100644 --- a/pvdaq.md +++ b/pvdaq.md @@ -34,6 +34,14 @@ These datasets differ from the regular PVDAQ repository storage architecture (Se **Note:** *Some of the prize datasets are extremely large and can have 10s of GBs of data. These could take a long time to download so please plan accordingly* +### PV Output +[PVOutput.org](https://pvoutput.org) is a free service for sharing, comparing and monitoring live solar photovoltaic (PV) and energy consumption data. Through the site, users can upload their own system production data, either manually or via API credentials. To date, over 800MW of systems are being recorded in this way. This PVDAQ site is sharing daily rolled-up energy data for PVOutput systems under a commercial license agreement. Full 5-minute data can be obtained via a [paid commercial license with PVOutput.org](https://pvoutput.org/services.jsp?id=101001). + +A subset of systems are shared here which have reached a data completeness / quality score of 0.6 as determined by [solardatatools](https://github.com/slacgismo/solar-data-tools), and a dataset length of at least 4 years. These systems are primarily residential systems which are self-reported by users. Additional details for the systems can be investigated at their owners sites at pvoutput.org, linked through the description pages. + +### PVDB +The PVDB dataset was made available through a collaboration between SunPower and the [SLAC GISMo team](https://esd.slac.stanford.edu/divisions/applied-energy). There are 226 residential sites and 12 commercial rooftop/carport sites in this dataset, all located in Orange County, California. + ## Data Dictionary The PVDAQ data is partitioned by system_id, year, month and day. Raw data is reported at 15 minute increments in ISO 8601 date and time. The timestamp is striped and data is averaged daily. An example file output is included here.