統計學
Visualizing Data 豆瓣
作者: William S. Cleveland Hobart Press 1993 - 3
Visualizing Data is about visualization tools that provide deep insight into the structure of data. There are graphical tools such as coplots, multiway dot plots,and the equal count algorithm. There are fitting tools such as loess and bisquare that fit equations, nonparametric curves,and nonparametric surfaces to data.
But the book is much more than just a compendium of useful tools. It conveys a strategy for data analysis that stresses the use of visualization to thoroughly study the structure of data and to check the validity of statistical models fitted to data. The result of the tools and the strategy is a vast increase in what you can learn from your data. The book demonstrates this by reanalyzing many data sets from the scientific literature, revealing missed effects and inappropriate models fitted to data.
Statistical Pattern Recognition 豆瓣
作者: Andrew R. Webb / Keith D. Copsey Wiley 2011 - 11
Statistical pattern recognition relates to the use of statistical techniques for analysing data measurements in order to extract information and make justified decisions. It is a very active area of study and research, which has seen many advances in recent years. Applications such as data mining, web searching, multimedia data retrieval, face recognition, and cursive handwriting recognition, all require robust and efficient pattern recognition techniques. This third edition provides an introduction to statistical pattern theory and techniques, with material drawn from a wide range of fields, including the areas of engineering, statistics, computer science and the social sciences. The book has been updated to cover new methods and applications, and includes a wide range of techniques such as Bayesian methods, neural networks, support vector machines, feature selection and feature reduction techniques.Technical descriptions and motivations are provided, and the techniques are illustrated using real examples. Statistical Pattern Recognition, 3rd Edition:* Provides a self-contained introduction to statistical pattern recognition.* Includes new material presenting the analysis of complex networks.* Introduces readers to methods for Bayesian density estimation.* Presents descriptions of new applications in biometrics, security, finance and condition monitoring.* Provides descriptions and guidance for implementing techniques, which will be invaluable to software engineers and developers seeking to develop real applications* Describes mathematically the range of statistical pattern recognition techniques.* Presents a variety of exercises including more extensive computer projects. The in-depth technical descriptions make the book suitable for senior undergraduate and graduate students in statistics, computer science and engineering. Statistical Pattern Recognition is also an excellent reference source for technical professionals. Chapters have been arranged to facilitate implementation of the techniques by software engineers and developers in non-statistical engineering fields. www.wiley.com/go/statistical-pattern-recognition
Trust in Numbers 豆瓣
作者: Theodore M. Porter Princeton University Press 2000 - 8
This investigation of the overwhelming appeal of quantification in the modern world discusses the development of cultural meanings of objectivity over two centuries. How are we to account for the current prestige and power of quantitative methods? The usual answer is that quantification is seen as desirable in social and economic investigation as a result of its successes in the study of nature. Theodore Porter is not content with this. Why should the kind of success achieved in the study of stars, molecules, or cells be an attractive model for research on human societies? he asks. And, indeed, how should we understand the pervasiveness of quantification in the sciences of nature? In his view, we should look in the reverse direction: comprehending the attractions of quantification in business, government, and social research will teach us something new about its role in psychology, physics, and medicine.
Drawing on a wide range of examples from the laboratory and from the worlds of accounting, insurance, cost-benefit analysis, and civil engineering, Porter shows that it is "exactly wrong" to interpret the drive for quantitative rigor as inherent somehow in the activity of science except where political and social pressures force compromise. Instead, quantification grows from attempts to develop a strategy of impersonality in response to pressures from outside. Objectivity derives its impetus from cultural contexts, quantification becoming most important where elites are weak, where private negotiation is suspect, and where trust is in short supply.
Statistics on the Table 豆瓣
作者: Stephen M. Stigler Harvard University Press 2002 - 10
This lively collection of essays examines in witty detail the history of some of the concepts involved in bringing statistical argument "to the table," and some of the pitfalls that have been encountered. The topics range from seventeenth-century medicine and the circulation of blood, to the cause of the Great Depression and the effect of the California gold discoveries of 1848 upon price levels, to the determinations of the shape of the Earth and the speed of light, to the meter of Virgil's poetry and the prediction of the Second Coming of Christ. The title essay tells how the statistician Karl Pearson came to issue the challenge to put "statistics on the table" to the economists Marshall, Keynes, and Pigou in 1911. The 1911 dispute involved the effect of parental alcoholism upon children, but the challenge is general and timeless: important arguments require evidence, and quantitative evidence requires statistical evaluation. Some essays examine deep and subtle statistical ideas such as the aggregation and regression paradoxes; others tell of the origin of the Average Man and the evaluation of fingerprints as a forerunner of the use of DNA in forensic science. Several of the essays are entirely nontechnical; all examine statistical ideas with an ironic eye for their essence and what their history can tell us about current disputes.
Statistical Rethinking 豆瓣
作者: Richard McElreath Chapman and Hall/CRC 2015
Statistical Rethinking: A Bayesian Course with Examples in R and Stan builds readers’ knowledge of and confidence in statistical modeling. Reflecting the need for even minor programming in today’s model-based statistics, the book pushes readers to perform step-by-step calculations that are usually automated. This unique computational approach ensures that readers understand enough of the details to make reasonable choices and interpretations in their own modeling work.
The text presents generalized linear multilevel models from a Bayesian perspective, relying on a simple logical interpretation of Bayesian probability and maximum entropy. It covers from the basics of regression to multilevel models. The author also discusses measurement error, missing data, and Gaussian process models for spatial and network autocorrelation.
By using complete R code examples throughout, this book provides a practical foundation for performing statistical inference. Designed for both PhD students and seasoned professionals in the natural and social sciences, it prepares them for more advanced or specialized statistical modeling.
Introduction to Statistical Decision Theory 豆瓣
作者: John Pratt / Howard Raiffa The MIT Press 2008 - 1
The Bayesian revolution in statistics - where statistics is integrated with decision making in areas such as management, public policy, engineering, and clinical medicine - is here to stay. Introduction to Statistical Decision Theory states the case and in a self-contained, comprehensive way shows how the approach is operational and relevant for real-world decision making under uncertainty.Starting with an extensive account of the foundations of decision theory, the authors develop the intertwining concepts of subjective probability and utility. They then systematically and comprehensively examine the Bernoulli, Poisson, and Normal (univariate and multivariate) data generating processes. For each process they consider how prior judgments about the uncertain parameters of the process are modified given the results of statistical sampling, and they investigate typical decision problems in which the main sources of uncertainty are the population parameters. They also discuss the value of sampling information and optimal sample sizes given sampling costs and the economics of the terminal decision problems.Unlike most introductory texts in statistics, Introduction to Statistical Decision Theory integrates statistical inference with decision making and discusses real-world actions involving economic payoffs and risks. After developing the rationale and demonstrating the power and relevance of the subjective, decision approach, the text also examines and critiques the limitations of the objective, classical approach.
贝叶斯方法与科学合理性 豆瓣
作者: 陈晓平 人民出版社 2010 - 2
康德说,休谟把他从独断论的迷梦中惊醒。笔者认为,如果休谟能够看到康德的书,他也许会说:康德把他从经验论的泥潭中拯救。在笔者看来,这两位最伟大的哲学家,其哲学思想是相互补充的。 本书是在贝叶斯理论的框架内,采纳并改进康德的先验范畴,进而给出休谟问题的一种解决。本书对若干有关科学合理性的疑难问题给以回应,其中包括反事实条件句与科学定律、分析与综合、还原与突现以及迪昂-奎因问题等。略
The History of Econometric Ideas 豆瓣
作者: Mary S. Morgan Cambridge University Press 2008 - 1
The History of Econometric Ideas covers the period from the late nineteenth century to the middle of the twentieth century, illustrating how economists first learned to harness statistical methods to measure and test the "laws" of economics. Though scholarly, Dr. Morgan's book is very accessible; it does not require a high level of prior statistical knowledge, and will be of interest to practicing statisticians and economists.
Data Mining, Fourth Edition: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Series in Data Management Systems) 豆瓣
作者: Ian H. Witten / Eibe Frank Morgan Kaufmann 2016
Data Mining: Practical Machine Learning Tools and Techniques, Fourth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated fourth edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches.
Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including substantial new chapters on probabilistic methods and on deep learning. Accompanying the book is a new version of the popular WEKA machine learning software from the University of Waikato. Authors Witten, Frank, Hall, and Pal include today's techniques coupled with the methods at the leading edge of contemporary research.
Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projectsPresents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methodsIncludes a downloadable WEKA software toolkit, a comprehensive collection of machine learning algorithms for data mining tasks-in an easy-to-use interactive interfaceIncludes open-access online courses that introduce practical applications of the material in the book
Statistical Learning from a Regression Perspective (Springer Series in Statistics) 豆瓣
作者: Richard A. Berk Springer 2008 - 7
Statistical Learning from a Regression Perspective considers statistical learning applications when interest centers on the conditional distribution of the response variable, given a set of predictors, and when it is important to characterize how the predictors are related to the response. As a first approximation, this is can be seen as an extension of nonparametric regression. Among the statistical learning procedures examined are bagging, random forests, boosting, and support vector machines. Response variables may be quantitative or categorical. Real applications are emphasized, especially those with practical implications. One important theme is the need to explicitly take into account asymmetric costs in the fitting process. For example, in some situations false positives may be far less costly than false negatives. Another important theme is to not automatically cede modeling decisions to a fitting algorithm. In many settings, subject-matter knowledge should trump formal fitting criteria. Yet another important theme is to appreciate the limitation of one's data and not apply statistical learning procedures that require more than the data can provide. The material is written for graduate students in the social and life sciences and for researchers who want to apply statistical learning procedures to scientific and policy problems. Intuitive explanations and visual representations are prominent. All of the analyses included are done in R.
Multivariate Dependencies 豆瓣
作者: D.R. Cox / N. Wermuth Chapman & Hall/CRC 1996 - 3
Large observational studies involving research questions that require the measurement of several features on each individual arise in many fields including the social and medical sciences. This book sets out both the general concepts and the more technical statistical issues involved in analysis and interpretation. Numerous illustrative examples are described in outline and four studies are discussed in some detail. The use of graphical representations of dependencies and independencies among the features under study is stressed, both to incorporate available knowledge at the planning stage of an analysis and to summarize aspects important for interpretation after detailed statistical analysis is complete. This book is aimed at research workers using statistical methods as well as statisticians involved in empirical research.
Principles of Statistical Inference 豆瓣
作者: D. R. Cox Cambridge University Press 2006 - 8
In this definitive book, D. R. Cox gives a comprehensive and balanced appraisal of statistical inference. He develops the key concepts, describing and comparing the main ideas and controversies over foundational issues that have been keenly argued for more than two-hundred years. Continuing a sixty-year career of major contributions to statistical thought, no one is better placed to give this much-needed account of the field. An appendix gives a more personal assessment of the merits of different ideas. The content ranges from the traditional to the contemporary. While specific applications are not treated, the book is strongly motivated by applications across the sciences and associated technologies. The mathematics is kept as elementary as feasible, though previous knowledge of statistics is assumed. The book will be valued by every user or student of statistics who is serious about understanding the uncertainty inherent in conclusions from statistical analyses.
The Nature of Statistical Learning Theory 豆瓣
作者: Vladimir Vapnik Springer 1999 - 11
The aim of this book is to discuss the fundamental ideas which lie behind the statistical theory of learning and generalization. It considers learning as a general problem of function estimation based on empirical data. Omitting proofs and technical details, the author concentrates on discussing the main results of learning theory and their connections to fundamental problems in statistics. This second edition contains three new chapters devoted to further development of the learning theory and SVM techniques. Written in a readable and concise style, the book is intended for statisticians, mathematicians, physicists, and computer scientists.