Latest Posts
It seamlessly replaces traditional MLPs, reducing FLOPS and params.
Its research results have been selected for NeurIPS 2024.
Highly aesthetic editing with precise response to any command. Now open to testing!
Research done by a post-95s and a post-00s generation
Decoupling control flow and computing flow for both flexibility and efficiency.
SIA Lab aims to achieve breakthroughs in foundational large model technologies and build industrial applications through effective industry-academia-research collaboration.
Comprehensive efforts have been made in the research and development of the basic models, which have unique advantages in various business scenarios.
With a unified and flexible framework, it supports score-to-song conversion, controllable generation, music and lyrics editing, and low-threshold voice cloning, etc.
"Pragmatic" and "technology-focused" are the team's fundamental principles.
ByteCheckpoint improves performance by up to 529.22 times in saving and 3.51 times in loading speeds.
Doubao large model's daily token usage exceeds 50 bn.
This paper proposes a novel IR-QLoRA for pushing quantized LLMs with LoRA to be highly accurate through information retention. This is the first time the perspective of information theory has been introduced and theories related to information entropy are utilized to examine and measure the quantization of large models. The paper has been selected for an oral presentation at ICML 2024. The first author of the paper is an intern with the speech group of ByteDance's Doubao Team (Seed) and a candidate in the ByteDance Scholars Program. Teachers and students from the State Key Laboratory of Complex & Critical Software Environment at Beihang University also contributed to the research.
The first author of the paper is a team intern.
Volcano Engine unleashes the power of large models to support academic frontier exploration
The relevant technology has been applied for a while and has been recognized by users in real-life scenarios.
Generative AI further enables the integration between the industry and the academic and research fields.