Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning

Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning
Author :
Publisher : Mario Capurso
Total Pages : 228
Release :
ISBN-10 :
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning by : Mario A. B. Capurso

Download or read book Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning written by Mario A. B. Capurso and published by Mario Capurso. This book was released on with total page 228 pages. Available in PDF, EPUB and Kindle. Book excerpt: This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that collects notions in a simplified form, facilitating a personal training path starting from specialized skills in Computer Science or Mathematics or Statistics. It has a bibliography with links to quality material but freely usable for your own training and contextual practical exercises. First of a series of books, it covers methodological aspects, data acquisition, management and cleaning. It describes the CRISP DM methodology, the working phases, the success criteria, the languages and the environments that can be used, the application libraries. Since this book uses Orange for the application aspects, its installation and widgets are described. Dealing with data acquisition, the book describes data sources, the acceleration techniques, the discretization methods, the security standards, the types and representations of the data, the techniques for managing corpus of texts such as bag-of-words, word-count , TF-IDF, n-grams, lexical analysis, syntactic analysis, semantic analysis, stop word filtering, stemming, techniques for representing and processing images, sampling, filtering, web scraping techniques. Examples are given in Orange. Data quality dimensions are analysed, and then the book considers algorithms for entity identification, truth discovery, rule-based cleaning, missing and repeated value handling, categorical value encoding, outlier cleaning, and errors, inconsistency management, scaling, integration of data from various sources and classification of open sources, application scenarios and the use of databases, datawarehouses, data lakes and mediators, data schema mapping and the role of RDF, OWL and SPARQL, transformations. Examples are given in Orange. The book is accompanied by supporting material and it is possible to download the project samples in Orange and sample data.


Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning Related Books

Data Science Quick Reference Manual – Methodological Aspects, Data Acquisition, Management and Cleaning
Language: en
Pages: 228
Authors: Mario A. B. Capurso
Categories: Computers
Type: BOOK - Published: - Publisher: Mario Capurso

DOWNLOAD EBOOK

This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that colle
Data Science Quick Reference Manual Exploratory Data Analysis, Metrics, Models
Language: en
Pages: 323
Authors: Mario A. B. Capurso
Categories: Computers
Type: BOOK - Published: 2023-08-23 - Publisher: Mario Capurso

DOWNLOAD EBOOK

This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that colle
Data Science Quick Reference Manual – Deep Learning
Language: en
Pages: 261
Authors: Mario A. B. Capurso
Categories: Computers
Type: BOOK - Published: 2023-09-04 - Publisher: Mario Capurso

DOWNLOAD EBOOK

This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that colle
Data Science Quick Reference Manual - Modeling and Machine Learning
Language: en
Pages: 191
Authors: Mario A. B. Capurso
Categories: Computers
Type: BOOK - Published: 2023-08-31 - Publisher: Mario Capurso

DOWNLOAD EBOOK

This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that colle
Data Science Quick Reference Manual - Advanced Machine Learning and Deployment
Language: en
Pages: 278
Authors: Mario A. B. Capurso
Categories: Computers
Type: BOOK - Published: 2023-09-08 - Publisher: Mario Capurso

DOWNLOAD EBOOK

This work follows the 2021 curriculum of the Association for Computing Machinery for specialists in Data Sciences, with the aim of producing a manual that colle