现在位置 >首页 >
所有关于数据分析的文章
Book Description:Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today’s modern data stack.
You’ll learn common considerations and key decision points when implementing pipelines, such as data pipe...
Book Description:Get a 360-degree view of how the journey of data analytics solutions has evolved from monolithic data stores and enterprise data warehouses to data lakes and modern data warehouses. You will
This book includes comprehensive coverage of how:To architect data lake analytics solutions by choosing suitable technologies available on Microsoft Azure
The advent of microservices applications covering ecommerce or modern solutions built on IoT and how real-...
内容简介:本书由Python pandas项目创始人Wes McKinney亲笔撰写,详细介绍利用Python进行操作、处理、清洗和规整数据等方面的具体细节和基本要点。第2版针对Python 3.6进行全面修订和更新,涵盖新版的pandas、NumPy、IPython和Jupyter,并增加大量实际案例,可以帮助你高效解决一系列数据分析问题。
第2版中的主要更新包括:
• 所有的代码,包括把Python的教程更新到了Python 3.6版本(第1版中使用的是Python 2.7)
• 更新了Python第三方发布版Anaconda和其他所需Python包的安装指引
• 更新pandas库到2017年的...
内容简介:本书以类似“章回小说”的活泼形式,生动地向读者展现优秀的数据分析人员应知应会的技术:数据分析基本步骤、实验方法、最优化方法、假设检验方法、贝叶斯统计方法、主观概率法、启发法、直方图法、回归法、误差处理、相关数据库、数据整理技巧;正文以后,意犹未尽地以三篇附录介绍数据分析十大要务、R工具及ToolPak工具,在充分展现目标知识以外,为读者搭建了走向深入研究的桥梁。
本书构思跌宕起伏,行文妙趣横生,无论是职场老手,还是业界新人,无论是字斟句酌,还是信手翻阅,相信都能跟着文...
Book Description:
Gain a thorough understanding of supervised learning algorithms by developing use cases with Python. You will study supervised learning concepts, Python code, datasets, best practices, resolution of common issues and pitfalls, and practical knowledge of implementing algorithms for structured as well as text and images datasets.
You’ll start with an introduction to machine learning, highlighting the differences between supervised, semi-supervised and unsu...
Book Description:
Business users familiar with Base SAS programming can now learn Python by example. You will learn via examples that map SAS programming constructs and coding patterns into their Python equivalents. Your primary focus will be on pandas and data management issues related to analysis of data.
It is estimated that there are three million or more SAS users worldwide today. As the data science landscape shifts from using SAS to open source software such as Py...
Book Description:
Build the foundational data science skills necessary to work with and better understand complex data science algorithms. This example-driven book provides complete Python coding examples to complement and clarify data science concepts, and enrich the learning experience. Coding examples include visualizations whenever appropriate. The book is a necessary precursor to applying and implementing machine learning algorithms.The book is self-contained. All ...
内容简介:想深入应用手中的数据?还是想在上千份文件上重复同样的分析过程?没有编程经验的非程序员们如何能在最短的时间内学会用当今炙手可热的Python语言进行数据分析?
来自Facebook的数据专家Clinton Brownley可以帮您解决上述问题。在他的这本书里,读者将能掌握基本Python编程方法,学会编写出处理电子表格和数据库中的数据的脚本,并了解使用Python模块来解析文件、分组数据和生成统计量的方法。
学习基础语法,创建并运行自己的Python脚本
读取和解析CSV文件
读取多个Excel工作表和工作簿
执行数据...
BOOK DESCRIPTION:
Many analysts are too concerned with tools and techniques for cleansing, modeling, and visualizing datasets and not concerned enough with asking the right questions. In this practical guide, data strategy consultant Max Shron shows you how to put the why before the how, through an often-overlooked set of analytical skills.
Thinking with Data helps you learn techniques for turning data into knowledge you can use. You’ll learn a framework for defining your...
Book Description:
A fresh look at visualization from the author of Visualize This Whether it’s statistical charts, geographic maps, or the snappy graphical statistics you see on your favorite news sites, the art of data graphics or visualization is fast becoming a movement of its own. In Data Points: Visualization That Means Something, author Nathan Yau presents an intriguing complement to his bestseller Visualize This, this time focusing on the graphics side of data anal...