Home ByteDance researchers up image generation efficiency through compression findings

ByteDance researchers up image generation efficiency through compression findings

Researchers at ByteDance have published 1.58-bit FLUX which is a new approach to AI model image generation compression as they aim to address current challenges of text-to-image models.

In the published paper, the people involved state that while many of the popular text-to-image models have “demonstrated remarkable generative capabilities,” they have immense parameter counts and high memory requirements which pose challenges for deployment.

This is highlighted as a potential barrier or difficulty on resource-constrained devices like mobile platforms.

To overcome this, the team compressed the FLUX system to three values which reduced storage by 8x.

“This work introduced 1.58-bit FLUX, in which 99.5% of the transformer parameters are quantized to 1.58 bits. With our custom computation kernels, 1.58-bit FLUX achieves a 7.7× reduction in model storage and more than a 5.1× reduction in inference memory usage,” stated the researchers in the paper.

Image generation in compression format is found to be comparable to full model

Two side-by-side images of a cat. AI-generated images that have been created with the prompt 'A cat made of sea water walking in a library.' That prompt is underneath the images.

Although new compression has been reached, industry benchmarks suggest the images are of comparable quality to the full model as they maintain high visual quality.

The hope is that these findings “inspires the community to develop more robust models for mobile devices.”

While the team has made substantial headway on current issues which could prevent wide-spread bottlenecks for real-world use cases, they have highlighted some limitations which they aim to address in future work.

This includes limitations on speed improvements. “Although 1.58-bit FLUX reduces model size and memory consumption, its latency is not significantly improved due to the absence of activation quantization and lack of further optimized kernel implementations.

“Given our promising results, we hope to inspire the community to develop custom kernel implementation for 1.58-bit models.”

Another highlighted issue is the limitations on visual qualities. The 1.58-bit FLUX is able to generate vivid images that are closely aligned with the text prompt, but the researchers suggest it “still lags behind the original FLUX model in rendering fine details at very high resolution. We aim to address this gap in future research.”

Featured image credit: Linked research paper

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech, gambling and blockchain industries for major developments, new product and brand launches, AI breakthroughs, game releases and other newsworthy events. Editors assign relevant stories to in-house staff writers with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Sophie Atkinson
Tech Journalist

Sophie Atkinson is a UK-based journalist and content writer, as well as a founder of a content agency which focuses on storytelling through social media marketing. She kicked off her career with a Print Futures Award which champions young talent working in print, paper and publishing. Heading straight into a regional newsroom, after graduating with a BA (Hons) degree in Journalism, Sophie started by working for Reach PLC. Now, with five years experience in journalism and many more in content marketing, Sophie works as a freelance writer and marketer. Her areas of specialty span a wide range, including technology, business,…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.