Skip to content

Commit

Permalink
consistent renamings
Browse files Browse the repository at this point in the history
  • Loading branch information
florian-huber committed Jun 24, 2024
1 parent 0667f5b commit 6c0300b
Show file tree
Hide file tree
Showing 32 changed files with 3,098 additions and 2,646 deletions.
9 changes: 8 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1,4 +1,11 @@
_build
datasets_local
material_and_raw
.ipynb_checkpoints
.ipynb_checkpoints

# Exclude too big datasets (they have download code in the notebooks)
datasets/madrid_reviews.csv
datasets/tripadvisor_hotel_reviews.csv
notebooks/word2vec_madrid_reviews.model

lib/
54 changes: 27 additions & 27 deletions _toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,51 +4,51 @@ parts:
- caption: Introduction
numbered: true # Only applies to chapters in Part 1.
chapters:
- file: book/intro
- file: book/01_intro_data_science
- file: book/02_data_science_ethics_society
- file: book/03_use_of_this_book
- file: book/01_intro
- file: book/02_what_is_data_science
- file: book/03_data_science_ethics_society
- file: book/04_use_of_this_book
- caption: Data Science Basics
numbered: true
chapters:
- file: book/04_data_and_types
- file: book/05_data_information_knowledge
- file: book/06_data_science_workflow
- file: book/05_data_and_types
- file: book/06_data_information_knowledge
- file: book/07_data_science_workflow
- caption: Data Acquisition and First Exploration
numbered: true
chapters:
- file: book/07_data_acquisition_and_preparation
- file: notebooks/live_coding_03_data_preparation
- file: notebooks/live_coding_04_distributions_statistical_measures
- file: book/08_data_acquisition_and_preparation
- file: notebooks/09_data_preparation
- file: notebooks/10_distributions_statistical_measures
- caption: In-depth Data Exploration
numbered: true
chapters:
- file: notebooks/live_coding_05_correlation_analysis
- file: notebooks/live_coding_06_clustering
- file: notebooks/live_coding_06b_introduction_outlier_detection
- file: notebooks/live_coding_07_dimensionality_reduction
- file: notebooks/11_correlation_analysis
- file: notebooks/12_clustering
- file: notebooks/13_introduction_outlier_detection
- file: notebooks/14_dimensionality_reduction
- caption: Supervised Machine Learning
numbered: true
chapters:
- file: notebooks/live_coding_08_machine_learning
- file: notebooks/live_coding_09_machine_learning_algorithms
- file: notebooks/live_coding_09b_machine_learning_algorithms_2
- file: notebooks/live_coding_09c_machine_learning_algorithms_3
- file: notebooks/live_coding_09d_machine_learning_techniques
- file: notebooks/live_coding_09e_machine_learning_ensembles
- file: notebooks/15_machine_learning
- file: notebooks/16_machine_learning_algorithms
- file: notebooks/17_machine_learning_algorithms_2
- file: notebooks/18_machine_learning_algorithms_3
- file: notebooks/19_machine_learning_techniques
- file: notebooks/20_machine_learning_ensembles
- caption: Working with Text Data
numbered: true
chapters:
- file: notebooks/live_coding_10_working_with_text_data
- file: notebooks/live_coding_11_NLP_2_tokenization
- file: notebooks/live_coding_11_NLP_3_tfifd_and_machine_learning
- file: notebooks/live_coding_12_NLP_4_ngrams_word_vectors
- file: notebooks/21_working_with_text_data
- file: notebooks/22_NLP_2_tokenization
- file: notebooks/23_NLP_3_tfifd_and_machine_learning
- file: notebooks/24_NLP_4_ngrams_word_vectors
- caption: Look at the Networks
numbered: true
chapters:
- file: notebooks/live_coding_13_graphs
- file: notebooks/live_coding_14_graph_visualization
- file: notebooks/live_coding_14_graphs_part2
- file: notebooks/25_graphs
- file: notebooks/26_graph_visualization
- file: notebooks/27_graphs_communities
- caption: Next Steps
numbered: true
chapters:
Expand Down
14 changes: 14 additions & 0 deletions book/01_intro.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
## Introduction: Data Science for Not-Yet-Scientists

In today's world, data is generated at an unprecedented pace, and our ability to harness it is changing the way we live, work, and even think. Data science, the interdisciplinary field that blends statistics, computer science, and domain-specific knowledge, empowers us to extract insights from this vast ocean of data. As data science becomes increasingly essential across various industries and sectors, there is a growing need for skilled professionals who can make sense of data and transform it into actionable information. This book is designed to give you a very broad and at the same time a very practical hands-on tour through the full spectrum of data science approaches.

There are numerous data science books, courses, and materials available, catering to different levels of expertise and backgrounds. However, many of these resources assume a strong foundation in computer science, math, or quantitative scientific disciplines. This is because until very recently, such career shifts were the typical path to becoming a data scientist (which also holds for the author of this book). But more and more universities or higher educational programs are starting to aim at the formation of a new generation of data scientists. Students who have only little prior IT-related formation and might not come with a prior scientific degree. This book is for them! It fills the described gap by providing a comprehensive, hands-on introduction to data science for those who are just starting their journey or considering a career in this fascinating domain.

The book was designed to be understandable to new undergraduate students, with only basic Python programming and math skills as requirements. You don't need to be an expert in computer science or have a strong background in statistics to grasp the concepts and techniques covered in this book. But we hope that this book will also be helpful for people in a career switch, researchers willing to deepen (or broaden) their data science skill set or people in industry feeling the need to move away from the limits in data analysis set by non-scripting software tools.

Throughout the chapters, you'll find many Python code examples and exercises that will help you develop a deep understanding of data science concepts and techniques. By working through these practical examples, you'll be able to apply your newly acquired knowledge to real-world situations, making you better equipped to tackle data-driven challenges in your chosen field.

We firmly believe that becoming proficient in data science is within reach for anyone who possesses a combination of intellectual curiosity, a passion for learning, and a knack for logical puzzles or detective work. Additionally, a basic affinity for math and statistics goes a long way.

But enough of the introduction. Let's dive in and learn how to do the detective work of a data scientist to extract new knowledge from complex data.

File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
16 changes: 0 additions & 16 deletions book/intro.md

This file was deleted.

File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
Loading

0 comments on commit 6c0300b

Please sign in to comment.