In machine learning, we use the term hyperparameter to distinguish from standard model parameters. So, it is worth to first understand what those are.express
A machine learning model is the definition of a mathematical formula with a number of parameters that need to be learned from the data. That is the crux of machine learning: fitting a model to the data. This is done through a process known as model training. In other words, by training a model with existing data, we are able to fit the model parameters.ide
However, there is another kind of parameters that cannot be directly learned from the regular training process. These parameters express 「higher-level」 properties of the model such as its complexity or how fast it should learn. They are called hyperparameters. Hyperparameters are usually fixed before the actual training process begins.ui
So, how are hyperparameters decided? That is probably beyond the scope of this question, but suffice to say that, broadly speaking, this is done by setting different values for those hyperparameters, training different models, and deciding which ones work best by testing them.this
So, to summarize. Hyperparameters:orm
Some examples of hyperparameters:ci