🤖 Machine Learning -
Testing

Updated at 2024-12-23 03:13

This note is about testing machine learning code in general, check out LLM evals notes for more in-depth pondering on model evaluation.

Unit tests can save weeks of time in machine learning projects. It can take days before you notice that something is not converging properly. And without tests, it can be hard concluding when and where was the issue introduced.

Writing good test coverage for normal code is hard enough, now add in the non-deterministic nature of machine learning, and you have a real challenge.

Most frameworks don't validate your neural network architecture. This stems from two the reason; 1) the frameworks are also used for research where new approaches are being explored and 2) even many of them are stable, they are still fairly new, rapidly developed tools.

The most frequent source of bugs is copy-pasting code. Especially in the non-data science code. Most data scientists are not software engineers, so they frequently use online solutions without fully understanding what is happening. You can mitigate this by researching what the different functions actually do, at least on the high level.

Possibly applicable approaches to testing:

Check that the network parameters change a bit after a mock image training.
If you have network parameters that shouldn't change, check they don't.
Check that loss is never 0%.
Check that accuracy is never 100%.
Seed your random generator so the tests are deterministic.

🤖 Machine Learning - Testing

Sources

🤖 Machine Learning -
Testing