Embedded code from a convolutional neural network

Let's train a neural network for a toy task - predicting geometric shapes, and check whether it can be compiled into C code, and then into a binary library to use in blocks or as part of another project.

Introduction

In this practical guide, we will train a neural network to recognize geometric shapes on a toy dataset, and then export the trained model to C code, compile it into a shared library, and test the possibility of integration into third-party projects or C code blocks in your project.

During the conversion, you will notice a slight loss of accuracy. This may be caused by differences in the implementation of some operations in Julia and in C (for example, batch normalization) or simple rounding of coefficients when translated into code, but it opens the way to deployment on embedded systems.

Preparation

At this stage, we download the necessary libraries, fix a random number generator, create a synthetic dataset of squares, circles, and triangles, and then visualize sample images of each class.

The generation of a controlled, balanced dataset with known properties (64×64 size, normalization to the range [-1.1]) allows you to check each stage of the pipeline in isolation without the influence of external factors.

Let's install the necessary libraries and initialize the random number generator so that our experiment is easily reproducible.:

# Installing the necessary packages
# Pkg.add(["Flux", "BSON", "ImageTransformations"])

using Random
Random.seed!(5);

Synthesizing a data set

Let's create a toy dataset consisting of three classes. Some of the objects are placed in the "unknown" folder, that is, their class, although it is written in the file name, will be unknown to the system. You can call this a validation dataset. The rest - training and test - are arranged in the appropriate folders.

include("$(@__DIR__)/_scripts/generate_shape_dataset.jl")
generate_shape_dataset(samples_per_class=200, test_samples=30, img_size=64)

Dataset generated:
  200 images of each class for training
  30 test images
  Image size: 64 x 64

At this stage, we try to generate a fairly diverse dataset (with triangle rotations), but at the same time do not overcomplicate the code, for example, we did not do augmentation in the learning process. Overall, this stage turned out to be the least problematic.

A look at the training dataset

Here are sample objects from our training dataset:

include("$(@__DIR__)/_scripts/show_dataset_samples.jl")
DATA_DIR = "$(@__DIR__)/training data";
gr()
show_dataset_samples(DATA_DIR, samples_per_class=10)

Model training and analysis

Here we start the learning process of a convolutional neural network, save the history of metrics, analyze the dynamics of accuracy and loss, and display a mosaic of predictions on test images.

Monitoring precision/recall metrics by class and early stopping for validation accuracy helps to detect overfitting in time and select the best model for subsequent export.

include("$(@__DIR__)/_scripts/train_model.jl");
DATA_DIR = "$(@__DIR__)/training data";
model, classes = train_model(DATA_DIR; epochs=100, imsize=64, batch_size=32, lr=0.0005, test_split=0.25, patience_limit=8);

Batch size: 32, Learning rate: 0.0005
Percentage of the test sample: 25.0%
Найдено классов: 3: ["square", "a circle", "triangle"]

=== Class distribution ===

Total images: 600 (64×64)
  Square: 200 images (33.3%)
  Circle: 200 images (33.3%)
  Triangle: 200 images (33.3%)

=== Data separation ===
  Training: 450 (75.0%)
  Test cases: 150 (25.0%)
Model parameters: 16035

=== Training ===
  Epoch 1/100, Train Loss: 1.2839, Train Acc: 41.3%, Test Acc: 39.3% ★ (precision/recall by class: square: 31.8%/42.0%, Circle: 45.7%/32.0%, Triangle: 49.0%/48.0%)
  Epoch 2/100, Train Loss: 1.1397, Train Acc: 46.2%, Test Acc: 46.7% ★ (precision/recall by class: square: 41.9%/52.0%, Circle: 41.9%/26.0%, Triangle: 45.6%/52.0%)
  Epoch 3/100, Train Loss: 1.0425, Train Acc: 63.8%, Test Acc: 58.0% ★ (precision/recall by class: square: 46.6%/54.0%, Circle: 34.0%/36.0%, Triangle: 59.0%/46.0%)
  Epoch 4/100, Train Loss: 0.9699, Train Acc: 65.8%, Test Acc: 55.3% (precision/recall by class: square: 62.3%/66.0%, Circle: 45.7%/42.0%, Triangle: 60.8%/62.0%)
  Epoch 5/100, Train Loss: 0.9363, Train Acc: 71.3%, Test Acc: 72.0% ★ (precision/recall by class: square: 59.3%/64.0%, Circle: 38.6%/34.0%, Triangle: 53.8%/56.0%)
  Epoch 6/100, Train Loss: 0.862, Train Acc: 73.1%, Test Acc: 67.3% (precision/recall by class: square: 65.1%/82.0%, Circle: 53.2%/50.0%, Triangle: 55.0%/44.0%)
  Epoch 7/100, Train Loss: 0.7955, Train Acc: 82.4%, Test Acc: 77.3% ★ (precision/recall by class: square: 68.7%/92.0%, Circle: 52.5%/42.0%, Triangle: 72.1%/62.0%)
  Epoch 8/100, Train Loss: 0.7538, Train Acc: 78.7%, Test Acc: 78.0% ★ (precision/recall by class: square: 74.2%/92.0%, Circle: 65.7%/46.0%, Triangle: 67.9%/72.0%)
  Epoch 9/100, Train Loss: 0.6834, Train Acc: 75.8%, Test Acc: 75.3% (precision/recall by class: square: 73.0%/92.0%, circle: 46.9%/46.0%, triangle: 57.9%/44.0%)
  Epoch 10/100, Train Loss: 0.6379, Train Acc: 86.4%, Test Acc: 82.7% ★ (precision/recall by class: square: 79.7%/94.0%, Circle: 71.1%/54.0%, Triangle: 71.7%/76.0%)
  Epoch 11/100, Train Loss: 0.609, Train Acc: 84.0%, Test Acc: 83.3% ★ (precision/recall by class: square: 87.5%/98.0%, Circle: 70.7%/58.0%, Triangle: 71.7%/76.0%)
  Epoch 12/100, Train Loss: 0.5567, Train Acc: 82.0%, Test Acc: 82.7% (precision/recall by class: square: 92.6%/100.0%, Circle: 63.8%/60.0%, Triangle: 67.3%/66.0%)
  Epoch 13/100, Train Loss: 0.5446, Train Acc: 65.8%, Test Acc: 63.3% (precision/recall by class: square: 92.3%/96.0%, Circle: 70.8%/68.0%, Triangle: 76.0%/76.0%)
  Epoch 14/100, Train Loss: 0.5065, Train Acc: 79.8%, Test Acc: 82.0% (precision/recall by class: square: 94.2%/98.0%, Circle: 68.5%/74.0%, Triangle: 77.3%/68.0%)
  Epoch 15/100, Train Loss: 0.4701, Train Acc: 88.9%, Test Acc: 86.0% ★ (precision/recall by class: square: 90.7%/98.0%, Circle: 76.1%/70.0%, triangle: 80.0%/80.0%)
  Epoch 16/100, Train Loss: 0.433, Train Acc: 67.8%, Test Acc: 66.0% (precision/recall by class: square: 94.1%/96.0%, Circle: 73.6%/78.0%, Triangle: 78.3%/72.0%)
  Epoch 17/100, Train Loss: 0.4185, Train Acc: 90.2%, Test Acc: 88.7% ★ (precision/recall by class: square: 96.2%/100.0%, Circle: 77.1%/74.0%, triangle: 78.0%/78.0%)
  Epoch 18/100, Train Loss: 0.3876, Train Acc: 95.3%, Test Acc: 92.7% ★ (precision/recall by class: square: 98.0%/98.0%, Circle: 77.8%/84.0%, Triangle: 82.6%/76.0%)
  Epoch 19/100, Train Loss: 0.3864, Train Acc: 94.2%, Test Acc: 92.7% (precision/recall by class: square: 94.0%/94.0%, Circle: 76.4%/84.0%, triangle: 84.4%/76.0%)
  Epoch 20/100, Train Loss: 0.3226, Train Acc: 94.9%, Test Acc: 91.3% (precision/recall by class: square: 90.6%/96.0%, Circle: 77.8%/84.0%, Triangle: 88.4%/76.0%)
  Epoch 21/100, Train Loss: 0.276, Train Acc: 86.0%, Test Acc: 86.0% (precision/recall by class: square: 94.1%/96.0%, Circle: 77.1%/74.0%, triangle: 80.4%/82.0%)
  Epoch 22/100, Train Loss: 0.2853, Train Acc: 91.8%, Test Acc: 89.3% (precision/recall by class: square: 94.3%/100.0%, Circle: 84.0%/84.0%, Triangle: 87.2%/82.0%)
  Epoch 23/100, Train Loss: 0.255, Train Acc: 82.7%, Test Acc: 79.3% (precision/recall by class: square: 98.0%/98.0%, Circle: 87.2%/82.0%, Triangle: 83.0%/88.0%)
  Epoch 24/100, Train Loss: 0.2077, Train Acc: 100.0%, Test Acc: 94.7% ★ (precision/recall by class: square: 92.3%/96.0%, Circle: 79.2%/76.0%, Triangle: 84.0%/84.0%)

   Early stop to achieve 100% accuracy on the training dataset

The best model loaded (Test Acc: 94.7%)

=== Results ===
  Best accuracy on the test: 94.7%
  Train/test accuracy: 94.9% / 89.3%
  ✓ No retraining (5.6% gap)
The training is completed! 🚀
The model is saved in model.bson

Let's look at the quality of the training conducted:

include("$(@__DIR__)/_scripts/analyze_training_log.jl")
gr()
df, classes, p = analyze_training_log("training_log.txt")
display(p)

It is interesting to interpret each graph separately. For example, precision grew almost equally for all classes, but the recall score immediately became better for squares, and was always behind for triangles, remaining not the highest by the end of the learning process.

We did not continue training after achieving 100% quality on the test, because there was no point in comparing implementations with each other. But we definitely should have generated more objects for the dataset, since, on average, by the end of training, the model accurately identified squares and circles, but out of the five proposed triangles, on average, one of them "did not notice". Although those that she marked as triangles were indeed triangles (the network showed more "false positive" errors for the "circle" class).

Forecasts from the Julia (Flux) neural network

include("$(@__DIR__)/_scripts/simple_mosaic.jl")
UNKNOWN_DIR = "$(@__DIR__)/unknown";
gr()
plot(create_simple_mosaic(UNKNOWN_DIR, imsize=64))

We see pretty good predictions, but this is not so much the result of successful training as the result of long-term work by the designer. The most time-consuming was the selection of the network architecture (number of layers, channels, use of BatchNorm and Dropout) and hyperparameters (learning rate, batch size, augmentation) in order to achieve stable convergence and avoid overfitting on a limited data set. As a result, for example, augmentation was moved to the dataset generating function to simplify the example, as well as due to the fact that this procedure is needed only for triangles.

Export to C and testing

Now we convert the preprocessed images to binary format, generate the C code of the neural network, compile it into an executable file, and visualize the predictions obtained from the C implementation. We knowingly assume that the code will work on platforms where there is no PNG library. Therefore, we convert images to binary format using a separate script. These binary files contain matrices, the elements of which include each color channel of each pixel, represented by a single UInt8 number.

include("$(@__DIR__)/_scripts/convert_png_to_rgb8.jl")
convert_png_to_rgb8("$(@__DIR__)/unknown", "$(@__DIR__)/unknown_rgb8", 64)

Now that we have a dataset with binary images ready, we can download the already trained model and translate it into C code. The key requirement for successful export is full alignment of data formats (RGB8 for images, HWC order of coefficients) and the order of weight traversal between Julia and C, which is achieved by explicit control of indexing and normalization at all stages.

include("$(@__DIR__)/_scripts/generate_cnn_code.jl")

using Flux, BSON
BSON.@load "$(@__DIR__)/model.bson" model classes
model = Flux.testmode!(model)

# Generating the library and the main program
generate_shared_lib(model, 64, length(classes))
generate_main_program(64, length(classes))

Generated neural_net.c and neural_net.h
Generated main.c

We will compile the neural network itself into a library. We also generated the main program, which feeds images from the "unknown_rgb8" folder to the neural network and processes the classification results.

;gcc -shared -fPIC neural_net.c -o libneuralnet.so -lm

;gcc main.c -o classify_unknown -ldl -lm

Interestingly, to run this neural network, we don't need any libraries, either Julia or C. It runs on any system that has a C compiler.

;./classify_unknown

File                 Prediction      Confidence
------------------------------------------------
circle_009.rgb circle 0.983
circle_010.rgb circle 0.998
circle_011.rgb circle 0.955
circle_012.rgb circle 0.993
circle_015.rgb circle 0.997
circle_016.rgb circle 0.964
circle_017.rgb circle 0.966
circle_020.rgb circle 0.943
circle_024.rgb circle 0.996
circle_025.rgb circle 1.000
circle_026.rgb circle 0.999
square_001.rgb square 0.701
square_003.rgb square 0.920
square_004.rgb square 0.739
square_005.rgb square 0.815
square_008.rgb square 0.923
square_013.rgb square 0.681
square_014.rgb square 0.743
square_018.rgb square 0.904
square_019.rgb square 0.937
square_021.rgb square 0.739
square_029.rgb square 0.817
triangle_002.rgb triangle 0.664
triangle_006.rgb triangle 0.626
triangle_007.rgb triangle 0.584
triangle_022.rgb circle 0.511
triangle_023.rgb circle 0.754
triangle_027.rgb triangle 0.664
triangle_028.rgb triangle 0.973
triangle_030.rgb circle 0.778
circle_014.rgb circle 0.999
circle_018.rgb circle 0.929
square_009.rgb circle 0.529
square_010.rgb square 0.921
square_015.rgb square 0.992
square_020.rgb square 0.926
square_023.rgb circle 0.567
square_024.rgb square 0.668
square_028.rgb square 0.879
square_030.rgb square 0.927
triangle_001.rgb triangle 0.702
triangle_008.rgb circle 0.564
triangle_011.rgb circle 0.580
triangle_012.rgb triangle 0.6666
triangle_013.rgb triangle 0.698
triangle_021.rgb triangle 0.626
triangle_029.rgb circle 0.707
circle_001.rgb circle 0.948
circle_002.rgb circle 0.990
circle_003.rgb circle 0.992
circle_004.rgb circle 1.000
circle_005.rgb circle 0.793
circle_007.rgb circle 0.985
circle_021.rgb circle 0.995
circle_022.rgb circle 0.912
circle_023.rgb circle 0.973
circle_028.rgb circle 0.989
circle_029.rgb circle 0.948
circle_030.rgb circle 0.992
square_002.rgb circle 0.498
square_007.rgb circle 0.649
square_016.rgb circle 0.715
square_026.rgb square 0.729
square_027.rgb square 0.868
triangle_003.rgb triangle 0.665
triangle_004.rgb triangle 0.558
triangle_009.rgb circle 0.810
triangle_010.rgb triangle 0.539
triangle_014.rgb triangle 0.922
triangle_016.rgb circle 0.707
triangle_017.rgb circle 0.564
triangle_020.rgb circle 0.510
triangle_025.rgb circle 0.497
triangle_026.rgb triangle 0.834

When transferring the model to C, we had to solve several non-trivial tasks: manually implementing convolutions and BatchNorm without third—party libraries, reducing all operations to a single HWC format, accurately reproducing the order of weight traversal (especially critical for multi-channel layers), as well as working with binary image files due to the lack of a PNG library in the target environment - all these The difficulties were successfully overcome.

Forecasts from the neural network in C

include("$(@__DIR__)/_scripts/create_mosaic_from_c_predictions.jl")
run(pipeline(`./classify_unknown`, stdout="pred.txt"))
UNKNOWN_DIR = "$(@__DIR__)/unknown";
gr()
mosaic_grouped = create_mosaic_from_c_predictions("is unknown", "pred.txt", max_images=8)

Warning: detected a stack overflow; program state may be corrupted, so further execution might be unreliable.

Despite these difficulties, we have demonstrated a full working pipeline, proving that exporting neural networks from Julia to C is possible even with limited resources of the target platform.

include("$(@__DIR__)/_scripts/predict_to_csv.jl")
UNKNOWN_DIR = "$(@__DIR__)/unknown";
predict_to_csv(UNKNOWN_DIR, confidence_threshold=0.4, output_csv="$(@__DIR__)/predictions.csv")
run(pipeline(`./classify_unknown`, stdout="pred.txt"))
include("$(@__DIR__)/_scripts/compare_c_and_julia.jl")
df = compare_c_and_julia()
sort(df)

Processed files: 74
  Square: 24
  Circle: 25
  Triangle: 25

=== Comparison of C and Julia ===
Total files: 74
Matching predictions: 58
Accuracy: 78.38%

Statistics of the difference in confidence:
  Average difference: 0.1674
  Max difference: 0.4689
  Min difference: 0.0056

Conclusion

We have shown how to go through the full cycle of creating a program with a neural network inside: from creating a dataset and training a model on Julia to exporting to C and checking performance, which confirms the fundamental possibility of using the generated code far beyond the Engee engineering platform.

Row	File	C_Prediction	C_Confidence	BaseName	Файл	Julia_Prediction	Julia_Confidence	Вероятность_квадрат	Вероятность_круг	Вероятность_треугольник	Match	Confidence_Diff
	String	String	Float64	String	String31	String31	Float64	Float64	Float64	Float64	Bool	Float64
1	circle_001.rgb	круг	0.948	circle_001	circle_001.png	круг	0.814261	0.0364875	0.814261	0.149251	true	0.133738
2	circle_002.rgb	круг	0.99	circle_002	circle_002.png	круг	0.943021	0.0138141	0.943021	0.0431649	true	0.0469789
3	circle_003.rgb	круг	0.992	circle_003	circle_003.png	круг	0.919059	0.0113165	0.919059	0.0696241	true	0.0729406
4	circle_004.rgb	круг	1.0	circle_004	circle_004.png	круг	0.983557	0.00701332	0.983557	0.00943003	true	0.0164434
5	circle_005.rgb	круг	0.793	circle_005	circle_005.png	круг	0.579309	0.0514736	0.579309	0.369217	true	0.213691
6	circle_007.rgb	круг	0.985	circle_007	circle_007.png	круг	0.911108	0.0521951	0.911108	0.0366972	true	0.0738923
7	circle_009.rgb	круг	0.983	circle_009	circle_009.png	круг	0.82299	0.0369218	0.82299	0.140089	true	0.16001
8	circle_010.rgb	круг	0.998	circle_010	circle_010.png	круг	0.956312	0.00526625	0.956312	0.0384219	true	0.0416882
9	circle_011.rgb	круг	0.955	circle_011	circle_011.png	круг	0.530945	0.00444727	0.530945	0.464608	true	0.424055
10	circle_012.rgb	круг	0.993	circle_012	circle_012.png	круг	0.936452	0.0160009	0.936452	0.047547	true	0.056548
11	circle_014.rgb	круг	0.999	circle_014	circle_014.png	круг	0.961115	0.00656026	0.961115	0.032325	true	0.0378853
12	circle_015.rgb	круг	0.997	circle_015	circle_015.png	круг	0.953765	0.00563846	0.953765	0.040597	true	0.0432354
13	circle_016.rgb	круг	0.964	circle_016	circle_016.png	круг	0.863125	0.0549023	0.863125	0.0819724	true	0.100875
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
63	triangle_016.rgb	круг	0.707	triangle_016	triangle_016.png	треугольник	0.887947	0.00269611	0.109357	0.887947	false	0.180947
64	triangle_017.rgb	круг	0.564	triangle_017	triangle_017.png	треугольник	0.88178	0.0117523	0.106468	0.88178	false	0.31778
65	triangle_020.rgb	круг	0.51	triangle_020	triangle_020.png	треугольник	0.978909	0.000415554	0.020676	0.978909	false	0.468909
66	triangle_021.rgb	треугольник	0.626	triangle_021	triangle_021.png	треугольник	0.998094	1.40282e-5	0.0018915	0.998094	true	0.372094
67	triangle_022.rgb	круг	0.511	triangle_022	triangle_022.png	треугольник	0.942509	0.00172809	0.0557631	0.942509	false	0.431509
68	triangle_023.rgb	круг	0.754	triangle_023	triangle_023.png	треугольник	0.804048	0.021133	0.174819	0.804048	false	0.0500482
69	triangle_025.rgb	круг	0.497	triangle_025	triangle_025.png	треугольник	0.824758	0.025768	0.149474	0.824758	false	0.327758
70	triangle_026.rgb	треугольник	0.834	triangle_026	triangle_026.png	треугольник	0.955393	0.000491825	0.0441154	0.955393	true	0.121393
71	triangle_027.rgb	треугольник	0.664	triangle_027	triangle_027.png	треугольник	0.885634	0.00767521	0.106691	0.885634	true	0.221634
72	triangle_028.rgb	треугольник	0.973	triangle_028	triangle_028.png	треугольник	0.993726	7.13982e-5	0.00620212	0.993726	true	0.0207265
73	triangle_029.rgb	круг	0.707	triangle_029	triangle_029.png	треугольник	0.887947	0.00269611	0.109357	0.887947	false	0.180947
74	triangle_030.rgb	круг	0.778	triangle_030	triangle_030.png	треугольник	0.827115	0.00261272	0.170272	0.827115	false	0.0491154