Cache-to-Cache(C2C): Direct Semantic Communication Between Large Language Models via KV-Cache Fusion
Can large language models collaborate without sending a single token of text? a team of researchers from Tsinghua University, Infinigence AI, The Chinese University of Hong Kong, Shanghai AI ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results