Skip to content

Commit

Permalink
update gh-page data
Browse files Browse the repository at this point in the history
  • Loading branch information
dbaranchuk committed Dec 2, 2024
1 parent 3822653 commit c96965e
Show file tree
Hide file tree
Showing 10 changed files with 46 additions and 16 deletions.
60 changes: 45 additions & 15 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -110,16 +110,26 @@ <h1 class="title is-1 publication-title">Switti: Designing Scale-Wise Transforme
</a>
</span>

<!-- ArXiv abstract Link -->
<span class="link-block">
<a href="https://huggingface.co/spaces/dbaranchuk/Switti" target="_blank"
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<span>&#129303;</span>
</span>
<span>Demo</span>
</a>
</span>

<!-- ArXiv abstract Link -->
<!-- <span class="link-block">
<a href="TODO" target="_blank"
class="external-link button is-normal is-rounded is-dark">
<span class="icon">
<i class="ai ai-arxiv"></i>
</span>
<span>arXiv</span>
</a>
</span>
</span> -->
</div>
</div>
</div>
Expand Down Expand Up @@ -152,11 +162,11 @@ <h2 class="title is-3">Abstract</h2>
<p>
This work presents Switti, a scale-wise transformer for text-to-image generation.
Starting from existing next-scale prediction AR models, we first explore them for T2I generation and propose architectural modifications to improve their convergence and overall performance.
We then observe that self-attention maps of the pretrained scale-wise AR model exhibit weak dependence on preceding scales.
Based on this insight, we propose a non-AR counterpart facilitating ~11% faster sampling and lower memory usage while achieving similar generation performance.
Furthermore, we reveal that classifier-free guidance at high-resolution scales is often unnecessary and can even degrade performance.
We then observe that self-attention maps of our pretrained scale-wise AR model exhibit weak dependence on preceding scales.
Based on this insight, we propose a non-AR counterpart facilitating ~11% faster sampling and lower memory usage while also achieving slightly better generation quality.
Furthermore, we reveal that classifier-free guidance at high-resolution scales is often unnecessary and can even degrade performance.
By disabling guidance at these scales, we achieve an additional sampling acceleration of ~20% and improve the generation of fine-grained details.
Extensive human preference studies and automated evaluations confirm that Switti outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being up to 7x faster.
Extensive human preference studies and automated evaluations show that Switti outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being up to 7x faster.
</p>
</div>
</div>
Expand All @@ -168,24 +178,29 @@ <h2 class="title is-3">Abstract</h2>
<!-- Image carousel -->
<section class="hero is-small">
<div class="hero-body">
<!-- <h2 class="subtitle has-text-centered"> -->
<!-- <font size="20"> SDXL</font> -->
</h2>
<div class="container">
<div id="results-carousel" class="carousel results-carousel">
<div class="item">
<!-- Your image here -->
<img src="static/images/main_image_grid_different_models.png" alt="MY ALT TEXT"/>
<h2 class="subtitle has-text-centered">
<img src="static/images/gh-page-image_grid_different_models_2.png" alt="MY ALT TEXT"/>
<!-- <h2 class="subtitle has-text-centered">
Qualitative comparison of Switti against competing AR and diffusion-based models.
</h2>
</h2> -->
</div>
<div class="item">
<!-- Your image here -->
<img src="static/images/supp_image_grid_different_models.png" alt="MY ALT TEXT"/>
<h2 class="subtitle has-text-centered">
<img src="static/images/gh-page-image_grid_different_models_0.png" alt="MY ALT TEXT"/>
<!-- <h2 class="subtitle has-text-centered">
Qualitative comparison of Switti against competing AR and diffusion-based models.
</h2>
</h2> -->
</div>
<div class="item">
<!-- Your image here -->
<img src="static/images/gh-page-image_grid_different_models_1.png" alt="MY ALT TEXT"/>
<!-- <h2 class="subtitle has-text-centered">
Qualitative comparison of Switti against competing AR and diffusion-based models.
</h2> -->
</div>
</div>
</div>
Expand All @@ -208,14 +223,29 @@ <h2 class="subtitle has-text-centered">
</div>
</div>
</section>
<br><br>
<section class="hero teaser">
<div class="container is-max-desktop" >
<h2 class="subtitle has-text-centered">
<font size="20"> Inference performance evaluation</font>
</h2>
<div class="container" align="center">
<img src="static/images/inference_results.png" alt="MY ALT TEXT" width="50%" />
<h2 class="subtitle has-text-centered">
Comparison of models’ 512×512 image generation time.
</h2>
</div>
</div>
</section>
</section>
<br><br>
<section class="hero teaser">
<div class="container is-max-desktop" >
<h2 class="subtitle has-text-centered">
<font size="20"> Automated Metrics</font>
</h2>
<div class="container" align="center">
<img src="static/images/metrics_results.png" alt="MY ALT TEXT" width="70%" />
<img src="static/images/switti_metrics_results.png" alt="MY ALT TEXT" width="100%" />
<h2 class="subtitle has-text-centered">
Quantitative comparison of Switti to competing AR and diffusion-based models.
</h2>
Expand Down
2 changes: 1 addition & 1 deletion static/css/fontawesome.all.min.css

Large diffs are not rendered by default.

Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added static/images/inference_results.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file removed static/images/main_image_grid_different_models.png
Binary file not shown.
Binary file removed static/images/metrics_results.png
Binary file not shown.
Binary file removed static/images/supp_image_grid_different_models.png
Binary file not shown.
Binary file added static/images/switti_metrics_results.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit c96965e

Please sign in to comment.