GPU/COMPUTE NEEDED IS NOT GOING TO BE ADEQUATE FOR TRAINING TOKENIZER, DYNAMICS AND RL

with current RTX 3060 GPU, training on the full dataset may take a month or more. Debugging etc is going to be even hard- not conducive to quick iterative learning. Reports with 12x more GPU compute show the tokenizer learns granular forms which affects in the subsequent learning process.

Reference: https://github.com/HKimiwada/Dreamer4
Trained on 8x 16GB V100 GPUs using data from zhwang4ai/OpenAI-Minecraft-Contractor (pixel/frame is not different than current config)

PROJECT STATUS: ABANDONED FOR NOW

Reference Implementations Reviewed

Implementation	Source	Target Domain	GPU Requirements
Dreamer4	V100 impl	Minecraft/Atari	8× 16GB V100
dreamer4-experiments	lucidrains	Generic	Not specified
dreamer4	Nicklas Hansen	DMControl (30 tasks)	8× 24GB RTX 3090

Dataset Comparison

Dataset	Frames	Resolution	Storage	Trajectories
Our MineRL (full)	4,350,893	64×64	52GB	759
Our MineRL (subset)	71,279	64×64	864MB	10
Hansen DMControl	3,600,000	128×128	350GB (processed)	7,200
OpenAI Contractor	Millions	360×640	Large	Thousands

P.S:

It did learn to predict pixels with epoch 9.

GROUND_TRUTH VS RECONSTRUCTION

Dreamer 4 absorbs the majority of its knowledge from unlabeled videos, and requires only a small amount of videos paired with actions.

MineRL is fully labeled (every frame has expert actions). You're wasting compute feeding high-entropy action embeddings when the model needs unlabeled diversity to learn physics. The 2B parameters will overfit to the 759 trajectories instead of learning generalizable world simulation.

The paper uses 360×640 (VPT) or at least 128×128 (DMControl). At 64×64: Inventory/crafting UI is illegible (critical for diamonds) Block textures blur together (diamond ore vs stone) Mouse cursor is invisible (needed for action grounding)

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
MNIST		MNIST
Notes		Notes
Pre_trained_tokenizer		Pre_trained_tokenizer
configs		configs
data		data
dreamer		dreamer
onehot		onehot
reconstruction_images_latest		reconstruction_images_latest
videos/phase1_latest_comparisons		videos/phase1_latest_comparisons
.gitignore		.gitignore
Cosmos-Tokenizer		Cosmos-Tokenizer
DreamerV4.pdf		DreamerV4.pdf
IMPLEMENTATION_COMPARISON.md		IMPLEMENTATION_COMPARISON.md
README.md		README.md
README_minerl_extraction.md		README_minerl_extraction.md
Training.md		Training.md
analyze_dataset.py		analyze_dataset.py
cosmos_tokenizer		cosmos_tokenizer
count_parameters.py		count_parameters.py
create_side_by_side_video.py		create_side_by_side_video.py
create_subset.py		create_subset.py
create_video_from_frames.py		create_video_from_frames.py
debug.md		debug.md
eval_phase1.py		eval_phase1.py
eval_phase2.py		eval_phase2.py
eval_phase3.py		eval_phase3.py
extract_minerl_frames.py		extract_minerl_frames.py
filesize-hover-explorer-1.0.2.vsix		filesize-hover-explorer-1.0.2.vsix
generate_full_reconstruction_images.py		generate_full_reconstruction_images.py
generate_reconstruction_images.py		generate_reconstruction_images.py
generate_videos_phase1.py		generate_videos_phase1.py
generate_videos_phase2.py		generate_videos_phase2.py
generate_videos_phase3.py		generate_videos_phase3.py
inspect_mcap.py		inspect_mcap.py
inspect_minerl_dataset.py		inspect_minerl_dataset.py
phase1_flow_diagram.txt		phase1_flow_diagram.txt
phase1_training_map.txt		phase1_training_map.txt
plot_loss_curves.py		plot_loss_curves.py
plot_losses.py		plot_losses.py
plot_losses_phase2.py		plot_losses_phase2.py
pretokenize_dataset.py		pretokenize_dataset.py
readme_notes.md		readme_notes.md
requirements.txt		requirements.txt
run_full_pipeline.py		run_full_pipeline.py
run_subset_pipeline.sh		run_subset_pipeline.sh
run_training_nohup.sh		run_training_nohup.sh
test_cosmos_integration.py		test_cosmos_integration.py
test_gpu.py		test_gpu.py
test_latent_decode.py		test_latent_decode.py
test_pipeline.py		test_pipeline.py
train_phase1.py		train_phase1.py
train_phase1_cosmos.py		train_phase1_cosmos.py
train_phase2.py		train_phase2.py
train_phase3.py		train_phase3.py
train_pretokenized.py		train_pretokenized.py
view_mcap.py		view_mcap.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPU/COMPUTE NEEDED IS NOT GOING TO BE ADEQUATE FOR TRAINING TOKENIZER, DYNAMICS AND RL

PROJECT STATUS: ABANDONED FOR NOW

Reference Implementations Reviewed

Dataset Comparison

P.S:

GROUND_TRUTH VS RECONSTRUCTION

About

Uh oh!

Releases

Packages

Languages

skr3178/DreamerV4

Folders and files

Latest commit

History

Repository files navigation

GPU/COMPUTE NEEDED IS NOT GOING TO BE ADEQUATE FOR TRAINING TOKENIZER, DYNAMICS AND RL

PROJECT STATUS: ABANDONED FOR NOW

Reference Implementations Reviewed

Dataset Comparison

P.S:

GROUND_TRUTH VS RECONSTRUCTION

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages