SSBD:downloads

ssbd-000026

SSBD:downloads provides download links, folder guides, and command examples for SSBD datasets. This page is designed for researchers who want to download data without advanced command-line knowledge.

All data can be used according to their respective licenses. Please refer to SSBD:repository and SSBD:database for license details.

To avoid excessive load on the download server, please keep the parallel download settings moderate, for example -x 4 -s 4 -j 2. Start with the folder guide below, then choose either a fast download method or a verified download method.

How to download and verify files
  • Use aria2c if you want faster downloads.
  • Use wget -x -nH if you want to keep the folder structure from the server.
  • Use sha256sum or md5sum to check that downloaded files match the published checksums.

Important: aria2c downloads files into the current folder and does not recreate the original folder paths from the URLs. Because of this, checksum verification may fail when the checksum file includes paths.

If you want checksum verification to work reliably, use wget with directory creation enabled and run the checksum command from the dataset directory.

Loading dataset metadata ...

What is in each folder?

If both bundled files and individual files are available, bundled files are usually easier to download and manage.

Project names and URLs

Metadata and checksums

Bundles

Individual files

Command examples

Fast download

Use this method when download speed is the priority.

aria2c -c -x 4 -s 4 -j 2 --min-split-size=100M -i source-manifest.txt
aria2c -c -x 4 -s 4 -j 2 --min-split-size=100M -i raw_image-manifest.txt

Verified download

Use this method when you want to preserve folder paths for checksum verification.

wget -x -nH -c -i source-manifest.txt
wget -x -nH -c -i raw_image-manifest.txt

Check downloaded files

Run checksum verification from the dataset directory so that the paths in the checksum file match your downloaded files.

sha256sum -c --ignore-missing sha256sum.txt

File tree

Folders load on demand.

Loading .tree/root.json ...