统计学习
Introduction to Linear Algebra 豆瓣 谷歌图书
9.6 (7 个评分) 作者: Gilbert Strang 出版社: Wellesley-Cambridge Press 2016 - 8
Linear algebra is something all mathematics undergraduates and many other students, in subjects ranging from engineering to economics, have to learn. The fifth edition of this hugely successful textbook retains all the qualities of earlier editions, while at the same time seeing numerous minor improvements and major additions. The latter include: • A new chapter on singular values and singular vectors, including ways to analyze a matrix of data • A revised chapter on computing in linear algebra, with professional-level algorithms and code that can be downloaded for a variety of languages • A new section on linear algebra and cryptography • A new chapter on linear algebra in probability and statistics. A dedicated and active website also offers solutions to exercises as well as new exercises from many different sources (including practice problems, exams, and development of textbook examples), plus codes in MATLAB®, Julia, and Python.
Information Theory, Inference and Learning Algorithms 豆瓣 Goodreads
Information Theory, Inference & Learning Algorithms
10.0 (5 个评分) 作者: David J. C. MacKay 出版社: Cambridge University Press 2003 - 10
Information theory and inference, taught together in this exciting textbook, lie at the heart of many important areas of modern technology - communication, signal processing, data mining, machine learning, pattern recognition, computational neuroscience, bioinformatics and cryptography. The book introduces theory in tandem with applications. Information theory is taught alongside practical communication systems such as arithmetic coding for data compression and sparse-graph codes for error-correction. Inference techniques, including message-passing algorithms, Monte Carlo methods and variational approximations, are developed alongside applications to clustering, convolutional codes, independent component analysis, and neural networks. Uniquely, the book covers state-of-the-art error-correcting codes, including low-density-parity-check codes, turbo codes, and digital fountain codes - the twenty-first-century standards for satellite communications, disk drives, and data broadcast. Richly illustrated, filled with worked examples and over 400 exercises, some with detailed solutions, the book is ideal for self-learning, and for undergraduate or graduate courses. It also provides an unparalleled entry point for professionals in areas as diverse as computational biology, financial engineering and machine learning.
非线性最优化基础 豆瓣
作者: [日] Masao Fukushima 译者: 林贵华 出版社: 科学出版社 2011 - 5
《非线性最优化基础》从凸分析的观点全面系统地介绍了非线性最优化的基本理论,是国际著名优化专家Masao Fulkushima教授的最新力作。书中不仅详尽透彻地讲解了(光滑与非光滑优化问题、半定规划问题等)各类优化问题的最优性理论、稳定性理论、灵敏度分析、对偶性理论以及相关的凸分析基础等,还深入介绍了变分不等式问题、非线性互补问题以及均衡约束数学规划问题等均衡问题的最新结果。
《非线性最优化基础》既可作为相关专业高年级本科生和研究生的教材,也可作为相关科研人员的参考书。
Bayesian Reasoning and Machine Learning 豆瓣 Goodreads
作者: David Barber 出版社: Cambridge University Press 2011 - 3
Machine learning methods extract value from vast data sets quickly and with modest resources. They are established tools in a wide range of industrial applications, including search engines, DNA sequencing, stock market analysis, and robot locomotion, and their use is spreading rapidly. People who know the methods have their choice of rewarding jobs. This hands-on text opens these opportunities to computer science students with modest mathematical backgrounds. It is designed for final-year undergraduates and master's students with limited background in linear algebra and calculus. Comprehensive and coherent, it develops everything from basic reasoning to advanced techniques within the framework of graphical models. Students learn more than a menu of techniques, they develop analytical and problem-solving skills that equip them for the real world. Numerous examples and exercises, both computer based and theoretical, are included in every chapter. Resources for students and instructors, including a MATLAB toolbox, are available online.
An Introduction to Statistical Learning 豆瓣 Goodreads
9.8 (12 个评分) 作者: Gareth James / Daniela Witten 出版社: Springer 2013 - 8
An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fields, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical software platform. Two of the authors co-wrote The Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. This book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. The text assumes only a previous course in linear regression and no knowledge of matrix algebra.
The Elements of Statistical Learning 豆瓣 Goodreads
9.8 (10 个评分) 作者: Trevor Hastie / Robert Tibshirani 出版社: Springer 2009 - 10
During the past decade there has been an explosion in computation and information technology. With it have come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting---the first comprehensive treatment of this topic in any book. This major new edition features many topics not covered in the original, including graphical models, random forests, ensemble methods, least angle regression & path algorithms for the lasso, non-negative matrix factorization, and spectral clustering. There is also a chapter on methods for "wide" data (p bigger than n), including multiple testing and false discovery rates.
支持向量机 豆瓣
作者: 邓乃扬 / 田英杰 出版社: 科学出版社 2009 - 8
《支持向量机:理论、算法与拓展》以分类问题(模式识别、判别分析)和回归问题为背景,介绍支持向量机的基本理论、方法和应用。特别强调对所讨论的问题和处理方法的实质进行直观的解释和说明,因此具有很强的可读性。为使具有一般高等数学知识的读者能够顺利阅读,书中首先介绍了最优化的基础知识。《支持向量机:理论、算法与拓展》可作为理工类、管理学等专业的高年级本科生、研究生和教师的教材或教学参考书,也可供相关领域的科研人员和实际工作者阅读参考。
金融时间序列分析 豆瓣 Goodreads
作者: Ruey S.Tsay 译者: 王辉 / 潘家柱 出版社: 人民邮电出版社 2009 - 6
本书全面阐述了金融时间序列,并主要介绍了金融时间序列理论和方法的当前研究热点和一些最新研究成果,尤其是风险值计算、高频数据分析、随机波动率建模和马尔科夫链蒙特卡罗方法等方面。此外,本书还系统阐述了金融计量经济模型及其在金融时间序列数据和建模中的应用,所有模型和方法的运用均采用实际金融数据,并给出了所用计算机软件的命令。较之第1版,本版主要在新的发展和实证分析方面进行了更新,新增了状态空间模型和Kalman滤波以及S-Plus命令等内容。 本书可作为时间序列分析的教材,也适用于商学、经济学、数学和统计学专业对金融的计量经济学感兴趣的高年级本科生和研究生,同时,也可作为商业、金融、保险等领域专业人士的参考书。
Probabilistic Graphical Models 豆瓣
作者: Daphne Koller / Nir Friedman 出版社: The MIT Press 2009 - 7
Most tasks require a person or an automated system to reason--to reach conclusions based on available information. The framework of probabilistic graphical models, presented in this book, provides a general approach for this task. The approach is model-based, allowing interpretable models to be constructed and then manipulated by reasoning algorithms. These models can also be learned automatically from data, allowing the approach to be used in cases where manually constructing a model is difficult or even impossible. Because uncertainty is an inescapable aspect of most real-world applications, the book focuses on probabilistic models, which make the uncertainty explicit and provide models that are more faithful to reality. Probabilistic Graphical Models discusses a variety of models, spanning Bayesian networks, undirected Markov networks, discrete and continuous models, and extensions to deal with dynamical systems and relational data. For each class of models, the text describes the three fundamental cornerstones: representation, inference, and learning, presenting both basic concepts and advanced techniques. Finally, the book considers the use of the proposed framework for causal reasoning and decision making under uncertainty. The main text in each chapter provides the detailed technical development of the key ideas. Most chapters also include boxes with additional material: skill boxes, which describe techniques; case study boxes, which discuss empirical cases related to the approach described in the text, including applications in computer vision, robotics, natural language understanding, and computational biology; and concept boxes, which present significant concepts drawn from the material in the chapter. Instructors (and readers) can group chapters in various combinations, from core topics to more technically advanced material, to suit their particular needs.
Foundations of Statistical Natural Language Processing 豆瓣
作者: Christopher D. Manning / Hinrich Schütze 出版社: The MIT Press 1999 - 6
Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.
统计学习理论的本质 豆瓣
作者: [美] Vladimir N. Vapnik 译者: 张学工 出版社: 清华大学出版社 2000
本书介绍了统计学习理论和支持向量机的关键思想、结论和方法,以及该领域的最新进展。统计学习理论是针对小样本情况研究统计学习规律的理论,是传统统计学的重要发展和补充。其核心思想是通过控制学习机器的容量实现对推广能力的控制。由Springer-Verlag出版社授权出版。
统计推断 豆瓣
Statistical Inference
作者: [美] George Casella / [美] Roger L. Berger 出版社: 机械工业出版社 2012 - 1
本书从概率论的基础开始,通过例子与习题的旁征博引,引进了大量近代统计处理的新技术和一些国内同类教材中不能见而广为使用的分布。其内容包括工科概率论入门、经典统计和现代统计的基础,又加进了不少近代统计中数据处理的实用方法和思想,例如:Bootstrap再抽样法、刀切(Jackknife)估计、EM算法、Logistic回归、稳健(Robust)回归、Markov链、Monte Carlo方法等。它的统计内容与国内流行的教材相比,理论较深,模型较多,案例的涉及面要广,理论的应用面要丰富,统计思想的阐述与算法更为具体。
本书可作为工科、管理类学科专业本科生、研究生的教材或参考书,也可供教师、工程技术人员自学之用。