IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering

Table of Contents

About The Project
Getting Started
- Prerequisites
- Installation
Usage
Evaluation
BibTeX

We present IntuiTF, a novel framework that bridges the semantic gap between user intent and transfer function design in Direct Volume Rendering. Our system allows users to intuitively design transfer function by specify visualization goals through multimodal interactions, built upon two key innovations: (1) an evolution-based TF space explorer that effectively navigates the vast parameter space, and (2) a MLLM-guided human-aligned evaluator that provides intelligent visual guidance. Through these components, users can intuitively express their visualization goals while the system automatically optimizes TFs to match their intent. Our framework demonstrates superior generalizability across various volumetric datasets. Please checkout more cases in our supplementary materials.

About The Project

IntuiTF is an intelligent transfer function (TF) design framework that bridges the semantic gap between user visual intent and transfer function design in Direct Volume Rendering. By leveraging Multi-modal Large Language Models (MLLMs), our system makes volumetric data visualization more intuitive and efficient.

Here's what makes IntuiTF special:

🧬 Evolution-Driven Explorer: Employs genetic algorithms with efficient TF encoding to systematically generate, mutate, and select diverse transfer function candidates across the vast parameter space.
🤖 Human-Aligned Evaluator: Leverages Multi-modal Large Language Models to provide visual quality assessment that aligns with human judgment and user intent, replacing static heuristics with intelligent evaluation.
🎯 Intent-Driven Optimization: Bridges the semantic gap between high-level user visualization goals and low-level transfer function parameters through multimodal interaction and automated optimization.
🌐 Generalized Framework: Demonstrates effectiveness across diverse volumetric datasets and visualization scenarios without requiring domain-specific priors or volume-specific training.

Compared to existing approaches, IntuiTF offers:

Exploration of unknown volumes from the start
Reduced semantic gap between user intent and TF design
Automated optimization guided by human-aligned evaluation
Enhanced generalizability across diverse datasets

We welcome suggestions and contributions! Feel free to fork the repo, create a pull request, or open an issue.

(back to top)

🚀 Getting Started

Follow these steps to set up TFEvolve locally.

🛠️ Prerequisites

Make sure you have the following installed:

Python 3.8.8
CUDA (for GPU acceleration)
Conda
DiffDVR (https://github.com/shamanDevel/DiffDVR)

📦 Installation

Clone the repository:

git clone https://github.com/wyysteelhead/TFevolve.git
cd TFevolve

Create and activate the conda environment:

conda create -n tfevolve python=3.8.8
conda activate tfevolve

Install DiffDVR first:

git clone https://github.com/shamanDevel/DiffDVR.git
cd DiffDVR
# Follow DiffDVR installation instructions
cd ..

Install dependencies:

pip install -r requirements.txt

(back to top)

🧑‍💻 Usage

You can run the genetic optimization directly using the genetic.py script. Here's the complete command structure:

python genetic_optimize/genetic.py \
    --base_url [YOUR_API_URL] \
    --api_key [YOUR_API_KEY] \
    --config_file "[/path/to/config.json]" \
    --prompt_folder ./prompt \
    --population_size 50 \
    --generations 30 \
    --save_interval 1 \
    --instruct_number "[task_name]" \
    --save_path "[/path/to/save/results]" \
    --quality_metrics "16,11,14" \
    --text_metrics "5" \
    --intent_interval "16" \
    --bg_color "(255,255,255)" \
    --style_image "[/path/to/style/image.png]" \
    --model_name [MODEL_NAME]

Command Parameters

--base_url: API endpoint URL
--api_key: Your API key
--config_file: Path to the dataset configuration file, our example config file locates in ./diffdvr/config-files
--prompt_folder: Directory containing prompt templates and instructions (our settings locates in ./prompt)
--population_size: Size of the genetic algorithm population
--generations: Number of generations to run
--save_interval: How often to save results (default: 1)
--instruct_number: The text instruction to use. Our instructions are kept in ./prompt/instructions.json. To customize your own instructon, add a new record in instructions.json, and call the unique id of the record
--save_path: Where to save the results
--quality_metrics: Quality assessment metrics defined in TFEvolve/prompt/aspects.json (comma-separated, default: "16,11,14")
--text_metrics: Text-based metrics (default: "5")
--intent_interval: Start round of intent alignment (default: "16")
--bg_color: Background color in RGB format (default: "(255,255,255)")
--style_image: Path to style reference image
--model_name: Name of the model to use (default: "gemini-2.0-flash-001")

Example Configurations

Image-based transfer function design:

python genetic_optimize/genetic.py \
    --config_file "./diffdvr/config-files/head.json" \
    --population_size 50 \
    --generations 30 \
    --save_interval 5 \
    --save_path "./results/head_image" \
    --style_image "./prompt/head.png"
    --quality_metrics "16,11,14" \
    --intent_interval "16"

Text-based transfer function design:

python genetic_optimize/genetic.py \
    --config_file "./diffdvr/config-files/engine.json" \
    --population_size 50 \
    --generations 30 \
    --save_interval 5 \
    --save_path "./results/engine_text" \
    --instruct_number "engine1" \
    --quality_metrics "16,11,14" \
    --text_metrics "5" \
    --intent_interval "16"

Note: The script is typically run in a tmux session for long-running tasks. You can use the task scripts which handle this automatically, or set up your own tmux session:

# Create a new tmux session
tmux new-session -d -s "task_name"

# Send the command to the tmux session
tmux send-keys -t "task_name" "conda activate genetic" C-m
tmux send-keys -t "task_name" "python genetic_optimize/genetic.py [parameters...]" C-m

(back to top)

Project Structure

TFEvolve/
├── diffdvr/              # DiffDVR integration and configuration
│   ├── config-files/     # Configuration files for different datasets
│   ├── parametrizations.py # Parameter definitions
│   ├── settings.py       # Global settings
│   └── utils.py         # Utility functions
├── genetic.py           # Main entry point for genetic algorithm
├── genetic_optimize/    # Core genetic optimization implementation
│   ├── api/            # LLM API integration
│   │   ├── llmapi.py   # Base LLM API interface
│   │   ├── openai.py   # OpenAI API implementation
│   │   └── gemini.py   # Google Gemini API implementation
│   ├── config/         # Configuration management
│   │   ├── config_manager.py # Configuration handling
│   │   └── default_config.json # Default configuration
│   ├── eval/           # Evaluation and ranking modules
│   │   ├── metric_eval.py # Baseline metric-based evaluation
│   │   ├── llm_eval.py # LLM-based evaluation
│   │   └── elo_rating.py # ELO rating system for ranking
│   ├── states/         # State management and configuration
│   │   ├── bound.py    # Boundary settings of rendering parameters
│   │   ├── evaluation_state.py # Evaluation result state
│   │   ├── gaussian.py # Gaussian state
│   │   └── genetic_config.py # Genetic algorithm parameter state
│   ├── utils/          # Utility functions
│   │   ├── file_utils.py # File operations
│   │   ├── image_utils.py # Image processing
│   │   └── thread.py   # Threading utilities
│   └── visualize/      # Visualization tools
│       └── gaussian_visualizer.py # Gaussian visualization
├── prompt/             # Prompt templates and assets
│   ├── instructions.json # Instructions examples
│   └── prompt_format.txt # Standard format of prompt
├── volumes/           # Volume data files, currently support cvol volume files
└── requirements.txt   # Project dependencies

Component Relationships

Core Components:
- genetic.py: The main implementation file that contains the genetic algorithm for transfer function optimization
- genetic_optimize/: Contains the genetic optimization implementation
- diffdvr/: Integration with DiffDVR for differentiable volume rendering
Task Scripts:
- scripts/tasks/: Contains shell scripts for different use cases
- Each script:
  - Sets up the environment
  - Runs the genetic optimization
  - Saves results

(back to top)

✅ TODO

Here are some planned features and improvements for TFEvolve:

Add support for additional volumetric data formats

(back to top)

📬 Contact

Project Lead: Yiyao Wang Email: wangyiyao@zju.edu.cn

Project Link: https://github.com/wyysteelhead/TFevolve

(back to top)

📚 BibTeX

If you use TFEvolve in your research, please cite our work:

@misc{wang2025intuiTF,
  title        = {IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering},
  author       = {Wang, Yiyao and Pan, Bo and Wang, Ke and Liu, Han and Mao, Jinyuan and Liu, Yuxin and Zhu, Minfeng and Zhang, Bo and Chen, Weifeng and Huang, Xiuqi and Chen, Wei},
  year         = {2025},
  publisher    = {arXiv},
  howpublished = {\url{https://github.com/wyysteelhead/IntuiTF}},
  doi = {10.48550/arXiv.2506.18407}
}

(back to top)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering

About The Project

🚀 Getting Started

🛠️ Prerequisites

📦 Installation

🧑‍💻 Usage

Command Parameters

Example Configurations

Project Structure

Component Relationships

✅ TODO

📬 Contact

📚 BibTeX

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.vscode		.vscode
anari		anari
diffdvr		diffdvr
genetic_optimize		genetic_optimize
images		images
prompt		prompt
volumes		volumes
.gitignore		.gitignore
README.md		README.md
genetic.py		genetic.py
requirements.txt		requirements.txt

wyysteelhead/IntuiTF

Folders and files

Latest commit

History

Repository files navigation

IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering

About The Project

🚀 Getting Started

🛠️ Prerequisites

📦 Installation

🧑‍💻 Usage

Command Parameters

Example Configurations

Project Structure

Component Relationships

✅ TODO

📬 Contact

📚 BibTeX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages