Skip to content

Conversation

yon3zu
Copy link

@yon3zu yon3zu commented Aug 22, 2025

Description of changes:
This pull request adds the xReverseLabs Domain Dataset to the AWS Open Data Registry.

Dataset details:

  • Daily Domain Dump Dataset (plain text, one domain per line)
  • Domain by Date Full Data (historical daily unique domains)
  • Domain by Extension Dataset (domains grouped by TLD, .txt.gz format)
  • Forward DNS (FDNS) Dataset (JSON records of DNS resolutions)

Repository with example notebooks:
https://github.com/xReverseLabs/open-data-examples

Documentation:
https://opendata.xreverselabs.org/about.php

License: CC-BY 4.0

This dataset supports threat intelligence, academic research, and large-scale DNS measurements.

@pschmied
Copy link
Contributor

Hi @yon3zu, thank you for opening this pull request, and apologies for the delay in our review. This is a good start to proceed with onboarding. Two items:

  1. Can you ensure that you have permitted us, as upstream maintainers, to modify your pull request per these instructions? https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/allowing-changes-to-a-pull-request-branch-created-from-a-fork

We will need to ensure that your branch is up-to-date with any other changes that have happened in the Registry in the meantime.

  1. Your notebook tutorial is a good start. Before launch, we'll ask that you expand on at least one of the ideas in the last section. We'd like to see enough of a problem description there that e.g. someone could read your problem statement and run with it. The current text is a bit sparse / ambiguous for someone to understand the problem / challenge you are issuing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants