Currently, the framework is designed for text-based poster image generation.
However, I am wondering if there is a way to extend this framework so that when an image is provided as input, the framework only renders the text in an aesthetically pleasing style without altering or changing the original image content itself.