diff --git a/.linkcheck.json b/.linkcheck.json index 64e9bd1cc..0bcc9fc4a 100644 --- a/.linkcheck.json +++ b/.linkcheck.json @@ -22,6 +22,7 @@ { "pattern": "^#bigquery-materialized-views$" }, { "pattern": "^#looker-pdts--aggregate-awareness$" }, { "pattern": "^#experiment-unpacking$" }, - { "pattern": "/v2-system-addon/data_events.html" } + { "pattern": "/v2-system-addon/data_events.html" }, + { "pattern": "^https://console.cloud.google.com/gcr/images/" } ] } diff --git a/src/cookbooks/bigquery/querying.md b/src/cookbooks/bigquery/querying.md index f43586ecd..f11dc2100 100644 --- a/src/cookbooks/bigquery/querying.md +++ b/src/cookbooks/bigquery/querying.md @@ -27,32 +27,32 @@ projects to maintain BigQuery [datasets](https://cloud.google.com/bigquery/docs/ ### Projects with BigQuery datasets -| Project | Dataset | Purpose | -| ------------------------------- | --------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | -| `mozdata` | | The primary home for user analysis; it has a short name that is easy to type and is filled with views that reference underlying tables in `moz-fx-data-shared-prod`; the default project for STMO and Looker | -| | `analysis` | User-generated tables for analysis; please prefix tables with your username | -| | `tmp` | User-generated tables for ephemeral analysis results; tables created here are automatically deleted after 7 days. | -| | `telemetry` | Views into legacy desktop telemetry pings and many derived tables; see _user-facing (unsuffixed) datasets_ below | -| | `` | See _user-facing (unsuffixed) datasets_ below | -| | `search` | Search data imported from parquet (_restricted_) | -| | `static` | Static tables, often useful for data-enriching joins | -| | `udf` | Internal persistent user-defined functions defined in SQL; see [Using UDFs](#using-udfs) | -| | `udf_js` | Internal user-defined functions defined in JavaScript; see [Using UDFs](#using-udfs) | -| `mozfun` | | The primary home for user-defined functions; see [Using UDFs](#using-udfs) | -| `moz-fx-data-bq-` | | Some teams have specialized needs and can be provisioned a team-specific project | -| `moz-fx-data-shared-prod` | | All production data including full pings and derived datasets defined in [bigquery-etl](https://github.com/mozilla/bigquery-etl) | -| | `_live` | See _live datasets_ below | -| | `_stable` | See _stable datasets_ below | -| | `_derived` | See _derived datasets_ below | -| | `_external` | Tables that reference external resources; these may be native BigQuery tables populated by a job that queries an third-party API, or they may be [federated data sources](https://cloud.google.com/bigquery/external-data-sources) that pull data from other GCP services like GCS at query time. | -| | `backfill` | Temporary staging area for back-fills | -| | `blpadi` | Blocklist ping derived data(_restricted_) | -| | `payload_bytes_raw` | Raw JSON payloads as received from clients, used for reprocessing scenarios, a.k.a. "landfill" (_restricted_) | -| | `payload_bytes_error` | `gzip`-compressed JSON payloads that were rejected in some phase of the pipeline; particularly useful for investigating schema validation errors | -| | `tmp` | Temporary staging area for parquet data loads | -| | `validation` | Temporary staging area for validation | -| `moz-fx-data-derived-datasets` | | Legacy project that was a precursor to `mozdata` | -| `moz-fx-data-shar-nonprod-efed` | | Non-production data produced by stage ingestion infrastructure | +| Project | Dataset | Purpose | +| ------------------------------- | --------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | +| `mozdata` | | The primary home for user analysis; it has a short name that is easy to type and is filled with views that reference underlying tables in `moz-fx-data-shared-prod`; the default project for STMO and Looker | +| | `analysis` | User-generated tables for analysis; please prefix tables with your username | +| | `tmp` | User-generated tables for ephemeral analysis results; tables created here are automatically deleted after 7 days. | +| | `telemetry` | Views into legacy desktop telemetry pings and many derived tables; see _user-facing (unsuffixed) datasets_ below | +| | `` | See _user-facing (unsuffixed) datasets_ below | +| | `search` | Search data imported from parquet (_restricted_) | +| | `static` | Static tables, often useful for data-enriching joins | +| | `udf` | Internal persistent user-defined functions defined in SQL; see [Using UDFs](#using-udfs) | +| | `udf_js` | Internal user-defined functions defined in JavaScript; see [Using UDFs](#using-udfs) | +| `mozfun` | | The primary home for user-defined functions; see [Using UDFs](#using-udfs) | +| `moz-fx-data-bq-` | | Some teams have specialized needs and can be provisioned a team-specific project | +| `moz-fx-data-shared-prod` | | All production data including full pings and derived datasets defined in [bigquery-etl](https://github.com/mozilla/bigquery-etl) | +| | `_live` | See _live datasets_ below | +| | `_stable` | See _stable datasets_ below | +| | `_derived` | See _derived datasets_ below | +| | `_external` | Tables that reference external resources; these may be native BigQuery tables populated by a job that queries an third-party API, or they may be [federated data sources](https://docs.cloud.google.com/bigquery/docs/external-data-sources) that pull data from other GCP services like GCS at query time. | +| | `backfill` | Temporary staging area for back-fills | +| | `blpadi` | Blocklist ping derived data(_restricted_) | +| | `payload_bytes_raw` | Raw JSON payloads as received from clients, used for reprocessing scenarios, a.k.a. "landfill" (_restricted_) | +| | `payload_bytes_error` | `gzip`-compressed JSON payloads that were rejected in some phase of the pipeline; particularly useful for investigating schema validation errors | +| | `tmp` | Temporary staging area for parquet data loads | +| | `validation` | Temporary staging area for validation | +| `moz-fx-data-derived-datasets` | | Legacy project that was a precursor to `mozdata` | +| `moz-fx-data-shar-nonprod-efed` | | Non-production data produced by stage ingestion infrastructure | ### Table Layout and Naming diff --git a/src/datasets/fxa.md b/src/datasets/fxa.md index 61af46b37..d6173843c 100644 --- a/src/datasets/fxa.md +++ b/src/datasets/fxa.md @@ -20,8 +20,6 @@ The [Mozilla accounts documentation](https://mozilla.github.io/ecosystem-platfor - Requires FxA. - [AMO](https://addons.mozilla.org/en-US/firefox/) - For developer accounts; not required by end-users to use or download addons. -- [Pocket](https://getpocket.com/login/?ep=1) - - FxA is an optional authentication method among others. - [Monitor](https://monitor.firefox.com) - Required to receive email alerts. Not required for email scans. - [Relay](https://relay.firefox.com/)