Details

Data Science Fundamentals with R, Python, and Open Data


Data Science Fundamentals with R, Python, and Open Data


1. Aufl.

von: Marco Cremonini

112,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 22.03.2024
ISBN/EAN: 9781394213252
Sprache: englisch
Anzahl Seiten: 480

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<b>Data Science Fundamentals with R, Python, and Open Data</b> <p> <b>Introduction to essential concepts and techniques of the fundamentals of R and Python needed to start data science projects</b> <p>Organized with a strong focus on open data, <i>Data Science Fundamentals with R, Python, and Open Data </i>discusses concepts, techniques, tools, and first steps to carry out data science projects, with a focus on Python and RStudio, reflecting a clear industry trend emerging towards the integration of the two. The text examines intricacies and inconsistencies often found in real data, explaining how to recognize them and guiding readers through possible solutions, and enables readers to handle real data confidently and apply transformations to reorganize, indexing, aggregate, and elaborate. <p>This book is full of reader interactivity, with a companion website hosting supplementary material including datasets used in the examples and complete running code (R scripts and Jupyter notebooks) of all examples. Exam-style questions are implemented and multiple choice questions to support the readers’ active learning. Each chapter presents one or more case studies. <p>Written by a highly qualified academic, <i>Data Science Fundamentals with R, Python, and Open Data </i>discuss sample topics such as: <ul><li>Data organization and operations on data frames, covering reading CSV dataset and common errors, and slicing, creating, and deleting columns in R </li><li>Logical conditions and row selection, covering selection of rows with logical condition and operations on dates, strings, and missing values </li><li>Pivoting operations and wide form-long form transformations, indexing by groups with multiple variables, and indexing by group and aggregations </li><li>Conditional statements and iterations, multicolumn functions and operations, data frame joins, and handling data in list/dictionary format</li></ul> <p><i>Data Science Fundamentals with R, Python, and Open Data </i>is a highly accessible learning resource for students from heterogeneous disciplines where Data Science and quantitative, computational methods are gaining popularity, along with hard sciences not closely related to computer science, and medical fields using stochastic and quantitative models.
<p>Preface xiii</p> <p>About the Companion Website xvii</p> <p>Introduction xix</p> <p><b>1 Open-Source Tools for Data Science 1</b></p> <p>1.1 R Language and RStudio 1</p> <p>1.2 Python Language and Tools 5</p> <p>1.3 Advanced Plain Text Editor 8</p> <p>1.4 CSV Format for Datasets 8</p> <p><b>2 Simple Exploratory Data Analysis 13</b></p> <p>2.1 Missing Values Analysis 13</p> <p>2.2 R: Descriptive Statistics and Utility Functions 15</p> <p>2.3 Python: Descriptive Statistics and Utility Functions 17</p> <p><b>3 Data Organization and First Data Frame Operations 23</b></p> <p>3.1 R: Read CSV Datasets and Column Selection 24</p> <p>3.2 R: Rename and Relocate Columns 36</p> <p>3.3 R: Slicing, Column Creation, and Deletion 38</p> <p>3.4 R: Separate and Unite Columns 45</p> <p>3.5 R: Sorting Data Frames 49</p> <p>3.6 R: Pipe 55</p> <p>3.7 Python: Column Selection 59</p> <p>3.8 Python: Rename and Relocate Columns 67</p> <p>3.9 Python: NumPy Slicing, Selection with Index, Column Creation and Deletion 69</p> <p>3.10 Python: Separate and Unite Columns 81</p> <p>3.11 Python: Sorting Data Frame 85</p> <p><b>4 Subsetting with Logical Conditions 99</b></p> <p>4.1 Logical Operators 99</p> <p>4.2 R: Row Selection 101</p> <p><b>5 Operations on Dates, Strings, and Missing Values 127</b></p> <p>5.1 R: Operations on Dates and Strings 129</p> <p>5.2 R: Handling Missing Values and Data Type Transformations 141</p> <p>5.3 R: Example with Dates, Strings, and Missing Values 154</p> <p>5.4 Pyhton: Operations on Dates and Strings 165</p> <p>5.5 Python: Handling Missing Values and Data Type Transformations 173</p> <p>5.6 Python: Examples with Dates, Strings, and Missing Values 182</p> <p><b>6 Pivoting and Wide-long Transformations 195</b></p> <p>6.1 R: Pivoting 197</p> <p>6.2 Python: Pivoting 202</p> <p><b>7 Groups and Operations on Groups 221</b></p> <p>7.1 R: Groups 222</p> <p>7.2 Python: Groups 244</p> <p><b>8 Conditions and Iterations 271</b></p> <p>8.1 R: Conditions and Iterations 272</p> <p>8.2 Python: Conditions and Iterations 284</p> <p><b>9 Functions and Multicolumn Operations 307</b></p> <p>9.1 R: User-defined Functions 308</p> <p>9.2 R: Multicolumn Operations 316</p> <p>9.3 Python: User-defined and Lambda Functions 330</p> <p><b>10 Join Data Frames 347</b></p> <p>10.1 Basic Concepts 348</p> <p>10.2 Python: Join Operations 369</p> <p><b>11 List/Dictionary Data Format 393</b></p> <p>11.1 R: List Data Format 395</p> <p>11.2 R: JSON Data Format and Use Cases 410</p> <p>11.3 Python: Dictionary Data Format 422</p> <p>Questions 443</p> <p>Index 447</p>
<p><b>Marco Cremonini </b>is Assistant Professor with the Department of Social and Political Sciences at the University of Milan, Italy. He is Academic Editor and Board Member of PLOS ONE and his current research interests are focused on computational network and agent-based models of propagation and behavior.
<p> <b>Introduction to essential concepts and techniques of the fundamentals of R and Python needed to start data science projects</b> <p>Organized with a strong focus on open data, <i>Data Science Fundamentals with R, Python, and Open Data </i>discusses concepts, techniques, tools, and first steps to carry out data science projects, with a focus on Python and RStudio, reflecting a clear industry trend emerging towards the integration of the two. The text examines intricacies and inconsistencies often found in real data, explaining how to recognize them and guiding readers through possible solutions, and enables readers to handle real data confidently and apply transformations to reorganize, indexing, aggregate, and elaborate. <p>This book is full of reader interactivity, with a companion website hosting supplementary material including datasets used in the examples and complete running code (R scripts and Jupyter notebooks) of all examples. Exam-style questions are implemented and multiple choice questions to support the readers’ active learning. Each chapter presents one or more case studies. <p>Written by a highly qualified academic, <i>Data Science Fundamentals with R, Python, and Open Data </i>discuss sample topics such as: <ul><li>Data organization and operations on data frames, covering reading CSV dataset and common errors, and slicing, creating, and deleting columns in R </li><li>Logical conditions and row selection, covering selection of rows with logical condition and operations on dates, strings, and missing values </li><li>Pivoting operations and wide form-long form transformations, indexing by groups with multiple variables, and indexing by group and aggregations </li><li>Conditional statements and iterations, multicolumn functions and operations, data frame joins, and handling data in list/dictionary format</li></ul> <p><i>Data Science Fundamentals with R, Python, and Open Data </i>is a highly accessible learning resource for students from heterogeneous disciplines where Data Science and quantitative, computational methods are gaining popularity, along with hard sciences not closely related to computer science, and medical fields using stochastic and quantitative models.

Diese Produkte könnten Sie auch interessieren:

Modeling Uncertainty
Modeling Uncertainty
von: Moshe Dror, Pierre L'Ecuyer, Ferenc Szidarovszky
PDF ebook
236,81 €
Level Crossing Methods in Stochastic Models
Level Crossing Methods in Stochastic Models
von: Percy H. Brill
PDF ebook
203,29 €
Continuous Bivariate Distributions
Continuous Bivariate Distributions
von: N. Balakrishnan, Chin Diew Lai
PDF ebook
128,39 €