12 billion parameter rectified flow transformer capable of generating an image based on a text description while following the structure of a given input image.
12 billion parameter rectified flow transformer capable of generating an image based on a text description while following the structure of a given input image.