Data is everything we perceive, describe.
For example, the population of Turkey is a data. The population of Germany, the population of the world, simply dogs, cats, houses, schools are all data.
Subcategories of data:
- Numeric Data
- Categorial Data
Data Science is collected under 3 main headings:
It is a communication tool used to tell our requests to the computer. Deep Learning is a sub-branch of Machine Learning and Machine Learning is a sub-branch of Data Science.
Machine Learning has 2 areas:
If e=0 in known data, there is no such thing as e0 in unknown data. The main purpose is to minimize the errors that will arise from the unseen data while training the machine in the "train". So, how can we do this?
The answer is: I just train the model on the "train" validation and set the hyperparameters of the model. But since I did this hyperparameter update according to its good performance on validation, my model starts to overfit the validation set, albeit indirectly. So I need data that it has never seen to test it implicitly as well.
In short 🤐, I split the "train" into 3:
Bias is when the model systematically discriminates. Models carry the ideas of the people who created them. That's why every model is as objective as its designer. (Look "overstimate and "understimate") 👉https://www.statisticshowto.com/what-is-bias/
Author of this article ✍: Merve Sena Çınar
Follow me on LinkedIn 💁♀️ https://www.linkedin.com/in/mervesenacinar/