Concrete ML v1.2.0: Hybrid Deployment and Inference Speed Improvements

October 17, 2023

—

Andrei Stoian

This new version of Concrete ML adds support for hybrid deployment and K-nearest neighbor classification. Hybrid deployment with Fully Homomorphic Encryption (FHE) is an approach that improves on-premise deployment by converting parts of the model to remote FHE computation, in order to protect model intellectual property (IP), ensure license compliance and facilitate usage monitoring. Concrete ML v1.2 also adds improvement to the built-in neural networks, making them 10x faster out-of-the-box.

On-premise hybrid deployment

Large Language-Models (LLMs) can enable large productivity increases when unleashed on confidential data that companies store in their knowledge bases. Many companies have policies to forbid their employees from using cloud based LLMs, since this may leak such confidential data. On the other hand, developers of proprietary LLMs want to ensure their model IP is protected. Indeed, in essence, LLMs are the result of extensive training and optimization processes, the weights and biases of an LLM are intrinsic to its value, performance, and identity. Protecting them is akin to safeguarding the intellectual, ethical, and economic interests embedded in the model.

Concrete ML introduces hybrid on-premise deployment for both LLMs and regular CNNs (Convolutional Neural Networks), which allows the model to be deployed partly on premise and partly in the cloud with FHE. Such a configuration affords the best of both worlds to all stakeholders: protecting both confidentiality and model IP. This use case example shows this scenario in action: some linear layers in the LLM are executed with FHE on the server-side and the client does not obtain those weights. Having clients make requests for each token generated makes billing easy and facilitates license compliance monitoring.

K-nearest neighbor (KNN) classification

KNN is a simple non-parametric machine learning model that proves very useful in many applications. Furthermore, the same underlying algorithm can perform similarity search, when a threshold on the distance is applied instead of class labels. By using Programmable Bootstrapping, TFHE can support top-k selection on encrypted distances. Thus, the full KNN algorithm is performed on encrypted data and the model training data is not exposed to the risk of leakage. See this notebook for a demo of the KNN classifier.

Optimized neural networks

Right bit-shift has been implemented in Concrete with a low-level cryptographic primitive, and it can divide encrypted input values by power-of-two scalars, with a much smaller cost than a full high-precision PBS. On the other hand, quantization aware training for neural networks can ensure that the quantization operation can be computed with such a division operation. Concrete ML now benefits from the combination of these two features and adds them under-the-hood to built-in neural networks. Results show that this optimization reduces inference time by an order of magnitude while preserving model accuracy and simplifying the built-in neural networks.

New development pipeline

To speed up bug-fixes and releases, Concrete ML development has moved to the public repository. Developers will thus be able to work with the latest code, in which we include bug-fixes requested through the community forums.

Additional links

Star the Concrete ML Github repository to endorse our work.
Review the Concrete ML documentation.
Get support on our community channels.
Zama Bounty Program: Create a privacy preserving version of Shazam using Concrete ML (15,000€ in prize 💰)

Latest Blog Posts

Bootstrapping TFHE ciphertexts in less than one millisecond

Hardware

The Zama team is happy to announce that the 1 millisecond frontier for a TFHE bootstrap has been crossed.

Zama Bounty Program Season 10: Create a “Hello FHEVM” Tutorial

Announcements

Create a complete, reproducible dApp example that helps new developers ship their first confidential application using FHEVM.

Launching the Zama Developer Program to support developers interested in building the next blockchain primitive with FHE.

Announcements

A way to support, recognize, reward and fast track the builders who choose primitives over trends.

Read more →

Back to blog

Privacy is necessary for an open society in the electronic age. Privacy is not secrecy. A private matter is something one doesn't want the whole world to know, but a secret matter is something one doesn't want anybody to know. Privacy is the power to selectively reveal oneself to the world.If two parties have some sort of dealings, then each has a memory of their interaction. Each party can speak about their own memory of this; how could anyone prevent it? One could pass laws against it, but the freedom of speech, even more than privacy, is fundamental to an open society; we seek not to restrict any speech at all. If many parties speak together in the same forum, each can speak to all the others and aggregate together knowledge about individuals and other parties. The power of electronic communications has enabled such group speech, and it will not go away merely because we might want it to.Since we desire privacy, we must ensure that each party to a transaction have knowledge only of that which is directly necessary for that transaction. Since any information can be spoken of, we must ensure that we reveal as little as possible. In most cases personal identity is not salient. When I purchase a magazine at a store and hand cash to the clerk, there is no need to know who I am. When I ask my electronic mail provider to send and receive messages, my provider need not know to whom I am speaking or what I am saying or what others are saying to me; my provider only need know how to get the message there and how much I owe them in fees. When my identity is revealed by the underlying mechanism of the transaction, I have no privacy. I cannot here selectively reveal myself; I must always reveal myself.Therefore, privacy in an open society requires anonymous transaction systems. Until now, cash has been the primary such system. An anonymous transaction system is not a secret transaction system. An anonymous system empowers individuals to reveal their identity when desired and only when desired; this is the essence of privacy.Privacy in an open society also requires cryptography. If I say something, I want it heard only by those for whom I intend it. If the content of my speech is available to the world, I have no privacy. To encrypt is to indicate the desire for privacy, and to encrypt with weak cryptography is to indicate not too much desire for privacy. Furthermore, to reveal one's identity with assurance when the default is anonymity requires the cryptographic signature.We cannot expect governments, corporations, or other large, faceless organizations to grant us privacy out of their beneficence. It is to their advantage to speak of us, and we should expect that they will speak. To try to prevent their speech is to fight against the realities of information. Information does not just want to be free, it longs to be free. Information expands to fill the available storage space. Information is Rumor's younger, stronger cousin; Information is fleeter of foot, has more eyes, knows more, and understands less than Rumor.We must defend our own privacy if we expect to have any. We must come together and create systems which allow anonymous transactions to take place. People have been defending their own privacy for centuries with whispers, darkness, envelopes, closed doors, secret handshakes, and couriers. The technologies of the past did not allow for strong privacy, but electronic technologies do.We the Cypherpunks are dedicated to building anonymous systems. We are defending our privacy with cryptography, with anonymous mail forwarding systems, with digital signatures, and with electronic money.Cypherpunks write code. We know that someone has to write software to defend privacy, and since we can't get privacy unless we all do, we're going to write it. We publish our code so that our fellow Cypherpunks may practice and play with it. Our code is free for all to use, worldwide. We don't much care if you don't approve of the software we write. We know that software can't be destroyed and that a widely dispersed system can't be shut down.Cypherpunks deplore regulations on cryptography, for encryption is fundamentally a private act. The act of encryption, in fact, removes information from the public realm. Even laws against cryptography reach only so far as a nation's border and the arm of its violence. Cryptography will ineluctably spread over the whole globe, and with it the anonymous transactions systems that it makes possible.For privacy to be widespread it must be part of a social contract. People must come and together deploy these systems for the common good. Privacy only extends so far as the cooperation of one's fellows in society. We the Cypherpunks seek your questions and your concerns and hope we may engage you so that we do not deceive ourselves. We will not, however, be moved out of our course because some may disagree with our goals.The Cypherpunks are actively engaged in making the networks safer for privacy. Let us proceed together apace.Onward.Eric Hughes9 March 1993