Concrete-ML allows you to compile a torch model to its FHE counterpart.
This process executes most of the concepts described in the documentation on how to use quantization and triggers the compilation to be able to run the model over homomorphically encrypted data.
Note that the architecture of the neural network passed to be compiled must respect some hard constraints given by FHE. Please read the our detailed documentation on these limitations.
Once your model is trained, you can simply call the compile_torch_model
function to execute the compilation.
You can then call quantized_numpy_module.forward_fhe.encrypt_run_decrypt()
to have the FHE inference.
Now your model is ready to infer in FHE settings.
fhe_prediction
contains the clear quantized output. The user can now dequantize the output to get the actual floating point prediction as follows:
If you want to see more compilation examples, you can check out the Fully Connected Neural Network
Our torch conversion pipeline uses ONNX and an intermediate representation. We refer the user to the Concrete ML ONNX operator reference for more information.
The following operators in torch will be exported as Concrete-ML compatible ONNX operators:
Operators that take an encrypted input and unencrypted constants:
Note that the equivalent versions from torch.functional
are also supported.