现在位置 >首页 >
所有关于数据分析的文章
Description:
Object Oriented Data Analysis is a framework that facilitates inter-disciplinary research through new terminology for discussing the often many possible approaches to the analysis of complex data. Such data are naturally arising in a wide variety of areas. This book aims to provide ways of thinking that enable the making of sensible choices.
The main points are illustrated with many real data examples, based on the authors’ personal experiences, which h...
Description:
Take a journey toward discovering, learning, and using Apache Spark 3.0. In this book, you will gain expertise on the powerful and efficient distributed data processing engine inside of Apache Spark; its user-friendly, comprehensive, and flexible programming model for processing data in batch and streaming; and the scalable machine learning algorithms and practical utilities to build machine learning applications.
Beginning Apache Spark 3 begins by expl...
Description:
Data is a fantastic raw resource for powering change in an organization, but all too often the people working in those organizations don’t have the necessary skills to communicate with data effectively. With this practical book, subject matter experts will learn ways to develop strong, persuasive points when presenting data to different groups in their organizations.
Author Carl Allchin shows anyone how to find data sources and develop data analytics, ...
Book Description:
Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack.
You'll learn common considerations and key decision points when implementing pipelines, such as data pipeline...
Book Description:
Get a 360-degree view of how the journey of data analytics solutions has evolved from monolithic data stores and enterprise data warehouses to data lakes and modern data warehouses. You will
This book includes comprehensive coverage of how:
To architect data lake analytics solutions by choosing suitable technologies available on Microsoft Azure
The advent of microservices applications covering ecommerce or modern solutions built on Io...
内容简介:
本书由Python pandas项目创始人Wes McKinney亲笔撰写,详细介绍利用Python进行操作、处理、清洗和规整数据等方面的具体细节和基本要点。第2版针对Python 3.6进行全面修订和更新,涵盖新版的pandas、NumPy、IPython和Jupyter,并增加大量实际案例,可以帮助你高效解决一系列数据分析问题。
第2版中的主要更新包括:
• 所有的代码,包括把Python的教程更新到了Python 3.6版本(第1版中使用的是Python 2.7)
• 更新了Python第三方发布版Anaconda和其他所需Python包的安装指引
• 更新pa...
内容简介:
本书以类似“章回小说”的活泼形式,生动地向读者展现优秀的数据分析人员应知应会的技术:数据分析基本步骤、实验方法、最优化方法、假设检验方法、贝叶斯统计方法、主观概率法、启发法、直方图法、回归法、误差处理、相关数据库、数据整理技巧;正文以后,意犹未尽地以三篇附录介绍数据分析十大要务、R工具及ToolPak工具,在充分展现目标知识以外,为读者搭建了走向深入研究的桥梁。
本书构思跌宕起伏,行文妙趣横生,无论是职场老手,还是业界新人,无论是字斟句酌,还是信手翻阅,相信都...
Book Description:
Gain a thorough understanding of supervised learning algorithms by developing use cases with Python. You will study supervised learning concepts, Python code, datasets, best practices, resolution of common issues and pitfalls, and practical knowledge of implementing algorithms for structured as well as text and images datasets.
You’ll start with an introduction to machine learning, highlighting the differences between supervised, semi-supervised and u...
Book Description:
Business users familiar with Base SAS programming can now learn Python by example. You will learn via examples that map SAS programming constructs and coding patterns into their Python equivalents. Your primary focus will be on pandas and data management issues related to analysis of data.
It is estimated that there are three million or more SAS users worldwide today. As the data science landscape shifts from using SAS to open source software such a...
Book Description:
Build the foundational data science skills necessary to work with and better understand complex data science algorithms. This example-driven book provides complete Python coding examples to complement and clarify data science concepts, and enrich the learning experience. Coding examples include visualizations whenever appropriate. The book is a necessary precursor to applying and implementing machine learning algorithms.
The book is self-contained. Al...