Anthropic i brev: Alibaba har försökt kopiera Claude
Det amerikanska AI‑bolaget Anthropic anklagar den kinesiska teknikjätten Alibaba för att ha försökt kopiera egenskaper hos språkmodellen Claude genom så kallad AI-destillering, rapporterar Bloomberg.
Enligt ett brev till amerikanska senatorer och Vita huset ska Alibabas forskningslabb Qwen AI ha använt nästan 25 000 falska konton för att genomföra 28,8 miljoner interaktioner med modellen mellan april och juni.
Anthropic uppger att det är det största kända försöket hittills av ett kinesiskt företag att utnyttja amerikanska AI‑system. Tidigare har kinesiska Deepseek anklagats för liknande metoder.
bakgrund
AI-destillering
Wikipedia (en)
In machine learning, knowledge distillation or model distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have more knowledge capacity than small models, this capacity might not be fully utilized. It can be just as computationally expensive to evaluate a model even if it utilizes little of its knowledge capacity. Knowledge distillation transfers knowledge from a large model to a smaller one without loss of validity. As smaller models are less expensive to evaluate, they can be deployed on less powerful hardware (such as a mobile device).
There is also a less common technique called Reverse Knowledge Distillation, where knowledge is transferred from a smaller model to a larger one.
Model distillation is not to be confused with model compression, which describes methods to decrease the size of a large model itself, without training a new model. Model compression generally preserves the architecture and the nominal parameter count of the model, while decreasing the bits-per-parameter.
Knowledge distillation has been successfully used in several applications of machine learning such as object detection, acoustic models, and natural language processing.
Recently, it has also been introduced to graph neural networks applicable to non-grid data.
Omni är politiskt obundna och oberoende. Vi strävar efter att ge fler perspektiv på nyheterna. Har du frågor eller synpunkter kring vår rapportering? Kontakta redaktionen