Statistics
Causal Inference 豆瓣 谷歌图书
作者: Hernán MA / Robins JM 出版社: Boca Raton: Chapman & Hall/CRC 2020
Causal inference is a complex scientific task that relies on evidence from multiple sources and a variety of methodological approaches. By providing a cohesive presentation of concepts and methods that are currently scattered across journals in several disciplines, Causal Inference: What If provides an introduction to causal inference for scientists who design studies and analyze data. The book is divided into three parts of increasing difficulty: causal inference without models, causal inference with models, and causal inference from complex longitudinal data.FEATURES:
- Emphasizes taking the causal question seriously enough to articulate it with sufficient precision
- Shows that causal inference from observational data relies on subject-matter knowledge and therefore cannot be reduced to a collection of recipes for data analysis
- Describes causal diagrams, both directed acyclic graphs and single-world intervention graphs
- Explains various data analysis approaches to estimate causal effects from individual-level data, including the g-formula, inverse probability weighting, g-estimation, instrumental variable estimation, outcome regression, and propensity score adjustment
- Includes software and real data examples, as well as 'Fine Points' and 'Technical Points' throughout to elaborate on certain key topicsCausal Inference: What If has been written for all scientists that make causal inferences, including epidemiologists, statisticians, psychologists, economists, sociologists, political scientists, computer scientists, and more. The book is substantially class-tested, as it has been used in dozens of universities to teach courses on causal inference at graduate and advanced undergraduate level.
Analyzing Linguistic Data 豆瓣
作者: R. H. Baayen 出版社: Cambridge University Press 2008 - 3
Statistical analysis is a useful skill for linguists and psycholinguists, allowing them to understand the quantitative structure of their data. This textbook provides a straightforward introduction to the statistical analysis of language. Designed for linguists with a non-mathematical background, it clearly introduces the basic principles and methods of statistical analysis, using 'R', the leading computational statistics programme. The reader is guided step-by-step through a range of real data sets, allowing them to analyse acoustic data, construct grammatical trees for a variety of languages, quantify register variation in corpus linguistics, and measure experimental data using state-of-the-art models. The visualization of data plays a key role, both in the initial stages of data exploration and later on when the reader is encouraged to criticize various models. Containing over 40 exercises with model answers, this book will be welcomed by all linguists wishing to learn more about working with and presenting quantitative data.
句法计量分析 豆瓣
作者: [德]莱因哈德·科勒 译者: 章红新 / 丛进 出版社: 商务印书馆 2019 - 3
本书阐释了构建符合科学哲学的语言学理论的主要思路,介绍了句法层面的定量概念与属性及其研究方法,在此基础上探讨了频率、复杂度、位置、深度、长度、信息量以及若干总藏之间等句法属性的关系,以及语言需求对于这些属性和关系的影响,从而建立了句法子系统协同模型。
本书将句法研究和计量语言学两个领域结合起来进行研究,用大量的事例说明计量模型和方法也可以应用到句法研究领域,为句法计量研究提供了新的思路,可供对句法或计量语言学感兴趣的研究者参阅。
2020年1月3日 已读
少数会从科学哲学引入的语言学领域,关注到这个领域学者重视Mario Bunge的系统了。自己以前对协同语言学的语言是一个自组织系统有误解,觉得过于偏重语言独立性,现在知道了原来它是用边界条件作为系统需求,而且分类已经比较细了。中间的篇幅相当于研究简报,研究都很统计学,取径则多用门策拉-阿尔特曼定律。借哥德尔配数函数方法做语言分形维度让人眼前一亮,风格学当如此,当然还提到词性的秩次也不错。开头吐槽二元论和集合论等影响下的“绝对静止观”确是20世纪语言学的大环境和缺陷,语料充裕领域从质性走向量化是必须的,觉得科勒基于其“定义清楚”做的动链单位是区别于旧的组合结构模式单位(如小句、短语)的计量好工具,其介绍的“跨范畴和谐”还有把配价看成非恒定特征(如补足语与附加语由计量属性替代)都是二元好解药。
Statistics Syntax Ynemlophics lib nobutdunbuy
Understanding Advanced Statistical Methods 豆瓣
作者: Peter Westfall / Kevin S. S. Henning 出版社: Chapman and Hall/CRC 2013 - 5
Providing a much-needed bridge between elementary statistics courses and advanced research methods courses, Understanding Advanced Statistical Methods helps students grasp the fundamental assumptions and machinery behind sophisticated statistical topics, such as logistic regression, maximum likelihood, bootstrapping, nonparametrics, and Bayesian methods. The book teaches students how to properly model, think critically, and design their own studies to avoid common errors. It leads them to think differently not only about math and statistics but also about general research and the scientific method. With a focus on statistical models as producers of data, the book enables students to more easily understand the machinery of advanced statistics. It also downplays the "population" interpretation of statistical models and presents Bayesian methods before frequentist ones. Requiring no prior calculus experience, the text employs a "just-in-time" approach that introduces mathematical topics, including calculus, where needed. Formulas throughout the text are used to explain why calculus and probability are essential in statistical modeling. The authors also intuitively explain the theory and logic behind real data analysis, incorporating a range of application examples from the social, economic, biological, medical, physical, and engineering sciences. Enabling your students to answer the why behind statistical methods, this text teaches them how to successfully draw conclusions when the premises are flawed. It empowers them to use advanced statistical methods with confidence and develop their own statistical recipes. Ancillary materials are available on the book's website.
Graphical Models, Exponential Families, and Variational Inference 豆瓣
作者: Martin J Wainwright / Michael I Jordan 出版社: Now Publishers Inc 2008
The formalism of probabilistic graphical models provides a unifying framework for capturing complex dependencies among random variables, and building large-scale multivariate statistical models. Graphical models have become a focus of research in many statistical, computational and mathematical fields, including bioinformatics, communication theory, statistical physics, combinatorial optimization, signal and image processing, information retrieval and statistical machine learning. Many problems that arise in specific instances-including the key problems of computing marginals and modes of probability distributions-are best studied in the general setting. Working with exponential family representations, and exploiting the conjugate duality between the cumulant function and the entropy for exponential families, Graphical Models, Exponential Families and Variational Inference develops general variational representations of the problems of computing likelihoods, marginal probabilities and most probable configurations. It describes how a wide variety of algorithms- among them sum-product, cluster variational methods, expectation-propagation, mean field methods, and max-product-can all be understood in terms of exact or approximate forms of these variational representations. The variational approach provides a complementary alternative to Markov chain Monte Carlo as a general source of approximation methods for inference in large-scale statistical models.
The Book of Why 豆瓣
作者: Judea Pearl / Dana Mackenzie 出版社: Allen Lane 2018 - 5
A Turing Award-winning computer scientist and statistician shows how understanding causality has revolutionized science and will revolutionize artificial intelligence
"Correlation is not causation." This mantra, chanted by scientists for more than a century, has led to a virtual prohibition on causal talk. Today, that taboo is dead. The causal revolution, instigated by Judea Pearl and his colleagues, has cut through a century of confusion and established causality--the study of cause and effect--on a firm scientific basis. His work explains how we can know easy things, like whether it was rain or a sprinkler that made a sidewalk wet; and how to answer hard questions, like whether a drug cured an illness. Pearl's work enables us to know not just whether one thing causes another: it lets us explore the world that is and the worlds that could have been. It shows us the essence of human thought and key to artificial intelligence. Anyone who wants to understand either needs The Book of Why.
2018年11月17日 已读
科学法是贝叶斯定理的一次应用。因果图形式化因果结构,do算子对有向无环图中指向X的有向边全部切断。由于变量不能全部观测,用前门准则来控制无法观察到的混杂因素,与RCT目标一致;若变量集合Z相对于(X,Y)满足后门准则,则X到Y因果可识别。感觉这些都是对相关性不能解决以及解决起来复杂的问题透明优化。反事实算法则扩宽数据解答问题的范围,NIE形式化间接影响。结构因果模型很大的一个优点就是对于线性非线性函数、离散或连续变量都有效。作者太卖关子,前几章讲统计学史,旧故事很多,7-9章是干货。思路是经典宏观实践的,因果哲学讲得很浅。但是应用领域极为广泛,毕竟是对相关性大改良,文科也能用呐。不知道因果模型处理相互干涉和叠加态什么的会怎么样。可能要读Causality一书才能深刻了解本书数学化的严格证明。
AI Causality JudeaPearl Judea_Pearl Reason
贝叶斯统计 豆瓣
作者: 茆诗松 出版社: 中国统计出版社 1999 - 1
《高等院校统计学专业规划教材•贝叶斯统计》共六章,可分二部分。前三章围绕先验分布介绍贝叶斯推断方法。后三章围绕损失函数介绍贝叶斯决策方法。阅读这些内容仅需要概率统计基本知识就够了。《高等院校统计学专业规划教材•贝叶斯统计》力图用生动有趣的例子来说明贝叶斯统计的基本思想和基本方法,尽量使读者对贝叶斯统计产生兴趣,引发读者使用贝叶方法去认识和解决实际问题的愿望。进而去丰富和发展贝叶斯统计。假如学生的兴趣被钓出来,愿望被引出来,那么讲授这一门课的目的也基本达到了。
Most Honourable Remembrance 豆瓣
作者: Andrew I. Dale 出版社: Springer 2003
"Interesting and useful as all this will be for anyone interested in knowing more about Bayes, this is just part of the riches contained in this book ...Beyond doubt this book is a work of the highest quality in terms of the scholarship it displays, and should be regarded as a must for every mathematical library." --MAA ONLINE
行为统计学基础 豆瓣
作者: 理查德·P·鲁尼 译者: 王星 出版社: 中国人民大学 2007 - 6
对统计学的学习来说,最主要的是掌握统计思想,理解相关的统计原理,能够根据实际情境提出解决问题的一个或几个合适方案,并懂得选择其中的最优。因此适合非统计专业学生的统计学理想教材,应该是能兼顾专业特点、深入浅出阐述统计学基本原理和方法,同时在轻快风趣的讲述中激发读者的学习兴趣,培养统计思维,并辅之例题分析,对使用中容易发生的错误加以提醒,切实提高学生应用统计方法分析解决实际问题的能力。《行为统计学基础》(第9版)正是这样一本非常出色的教材。本书写作风格轻松活泼,语言流畅易懂,数学深入浅出,读者在学习和阅读时不会感到枯燥乏味。
本书是心理和教育统计学方面的一本优秀的基础教材,对于在社会科学领域中的广大研究人员来说,也是一本不可多得的重要参考书
Causal Inference in Statistics, Social, and Biomedical Sciences 豆瓣
作者: Guido W. Imbens / Donald B. Rubin 出版社: Cambridge University Press 2015 - 3
Most questions in social and biomedical sciences are causal in nature: what would happen to individuals, or to groups, if part of their environment were changed? In this groundbreaking text, two world-renowned experts present statistical methods for studying such questions. This book starts with the notion of potential outcomes, each corresponding to the outcome that would be realized if a subject were exposed to a particular treatment or regime. In this approach, causal effects are comparisons of such potential outcomes. The fundamental problem of causal inference is that we can only observe one of the potential outcomes for a particular subject. The authors discuss how randomized experiments allow us to assess causal effects and then turn to observational studies. They lay out the assumptions needed for causal inference and describe the leading analysis methods, including, matching, propensity-score methods, and instrumental variables. Many detailed applications are included, with special focus on practical aspects for the empirical researcher.
The History of Statistics 豆瓣
作者: Stephen M. Stigler 出版社: Belknap Press 1990 - 3
Review
Journal of Modern History : The book is a pleasure to read: the prose sparkles; the protagonists are vividly drawn; the illustrations are handsome and illuminating; the insights plentiful and sharp. This will remain the definitive work on the early development of mathematical statistics for some time to come.
--Lorraine J. Daston
Science : An exceptionally searching, almost loving, study of the relevant inspirations and aberrations of its principal characters James Bernoulli, de Moivre, Bayes, Laplace, Gauss, Quetelet, Lexis, Galton, Edgeworth, and Pearson, not neglecting a grand supporting cast...The definitive record of an intellectual Golden Age, an overoptimistic climb to a height not to be maintained.
--M. Stone
New York Times Book Review : One is tempted to say that the history of statistics in the nineteenth century will be associated with the name Stigler.
--Morris Kline
Contemporary Psychology : In this tour de force of careful scholarship, Stephen Stigler has laid bare the people, ideas, and events underlying the development of statistics...He has written an important and wonderful book...Sometimes Stigler's prose is so evocative it is almost poetic.
--Howard Wainer
Review
Stigler's book exhibits a rare combination of mastery of technical materials, sensitivity to conceptual milieu, and near exhaustive familiarity with primary sources. An exemplary study
--Lorraine Daston
Causality 豆瓣
作者: Judea Pearl 出版社: Cambridge University Press 2009 - 9
Written by one of the preeminent researchers in the field, this book provides a comprehensive exposition of modern analysis of causation. It shows how causality has grown from a nebulous concept into a mathematical theory with significant applications in the fields of statistics, artificial intelligence, economics, philosophy, cognitive science, and the health and social sciences. Judea Pearl presents and unifies the probabilistic, manipulative, counterfactual, and structural approaches to causation and devises simple mathematical tools for studying the relationships between causal connections and statistical associations. Cited in more than 2,100 scientific publications, it continues to liberate scientists from the traditional molds of statistical thinking. In this revised edition, Judea Pearl elucidates thorny issues, answers readers' questions, and offers a panoramic view of recent advances in this field of research. Causality will be of interest to students and professionals in a wide variety of fields. Dr Judea Pearl has received the 2011 Rumelhart Prize for his leading research in Artificial Intelligence (AI) and systems from The Cognitive Science Society.
Time Series Analysis 豆瓣
作者: George E. P. Box / Gwilym M. Jenkins 出版社: Wiley 2008 - 6
A modernized new edition of one of the most trusted books on time series analysis. Since publication of the first edition in 1970, Time Series Analysis has served as one of the most influential and prominent works on the subject. This new edition maintains its balanced presentation of the tools for modeling and analyzing time series and also introduces the latest developments that have occurred n the field over the past decade through applications from areas such as business, finance, and engineering. The Fourth Edition provides a clearly written exploration of the key methods for building, classifying, testing, and analyzing stochastic models for time series as well as their use in five important areas of application: forecasting; determining the transfer function of a system; modeling the effects of intervention events; developing multivariate dynamic models; and designing simple control schemes. Along with these classical uses, modern topics are introduced through the book's new features, which include: A new chapter on multivariate time series analysis, including a discussion of the challenge that arise with their modeling and an outline of the necessary analytical tools New coverage of forecasting in the design of feedback and feedforward control schemes A new chapter on nonlinear and long memory models, which explores additional models for application such as heteroscedastic time series, nonlinear time series models, and models for long memory processes Coverage of structural component models for the modeling, forecasting, and seasonal adjustment of time series A review of the maximum likelihood estimation for ARMA models with missing values Numerous illustrations and detailed appendices supplement the book,while extensive references and discussion questions at the end of each chapter facilitate an in-depth understanding of both time-tested and modern concepts. With its focus on practical, rather than heavily mathematical, techniques, Time Series Analysis , Fourth Edition is the upper-undergraduate and graduate levels. this book is also an invaluable reference for applied statisticians, engineers, and financial analysts.
点击链接进入中文版:
时间序列分析:预测与控制
Statistical Rethinking 豆瓣
作者: Richard McElreath 出版社: Chapman and Hall/CRC 2015
Statistical Rethinking: A Bayesian Course with Examples in R and Stan builds readers’ knowledge of and confidence in statistical modeling. Reflecting the need for even minor programming in today’s model-based statistics, the book pushes readers to perform step-by-step calculations that are usually automated. This unique computational approach ensures that readers understand enough of the details to make reasonable choices and interpretations in their own modeling work.
The text presents generalized linear multilevel models from a Bayesian perspective, relying on a simple logical interpretation of Bayesian probability and maximum entropy. It covers from the basics of regression to multilevel models. The author also discusses measurement error, missing data, and Gaussian process models for spatial and network autocorrelation.
By using complete R code examples throughout, this book provides a practical foundation for performing statistical inference. Designed for both PhD students and seasoned professionals in the natural and social sciences, it prepares them for more advanced or specialized statistical modeling.
Probability Theory 豆瓣 Goodreads
Probability Theory: The Logic of Science
作者: E. T. Jaynes 出版社: Cambridge University Press 2003 - 6
The standard rules of probability can be interpreted as uniquely valid principles in logic. In this book, E. T. Jaynes dispels the imaginary distinction between 'probability theory' and 'statistical inference', leaving a logical unity and simplicity, which provides greater technical power and flexibility in applications. This book goes beyond the conventional mathematics of probability theory, viewing the subject in a wider context. New results are discussed, along with applications of probability theory to a wide variety of problems in physics, mathematics, economics, chemistry and biology. It contains many exercises and problems, and is suitable for use as a textbook on graduate level courses involving data analysis. The material is aimed at readers who are already familiar with applied mathematics at an advanced undergraduate level or higher. The book will be of interest to scientists working in any area where inference from incomplete information is necessary.
Bayesian Reasoning and Machine Learning 豆瓣 Goodreads
作者: David Barber 出版社: Cambridge University Press 2011 - 3
Machine learning methods extract value from vast data sets quickly and with modest resources. They are established tools in a wide range of industrial applications, including search engines, DNA sequencing, stock market analysis, and robot locomotion, and their use is spreading rapidly. People who know the methods have their choice of rewarding jobs. This hands-on text opens these opportunities to computer science students with modest mathematical backgrounds. It is designed for final-year undergraduates and master's students with limited background in linear algebra and calculus. Comprehensive and coherent, it develops everything from basic reasoning to advanced techniques within the framework of graphical models. Students learn more than a menu of techniques, they develop analytical and problem-solving skills that equip them for the real world. Numerous examples and exercises, both computer based and theoretical, are included in every chapter. Resources for students and instructors, including a MATLAB toolbox, are available online.
An Introduction to Statistical Learning 豆瓣 Goodreads
9.8 (12 个评分) 作者: Gareth James / Daniela Witten 出版社: Springer 2013 - 8
An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fields, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical software platform. Two of the authors co-wrote The Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. This book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. The text assumes only a previous course in linear regression and no knowledge of matrix algebra.