楼主: 小二来耕地
跳转到指定楼层
上一主题 下一主题
收起左侧

[统计--就业] data analyst面试统计知识总结~求大米

   
全局:
What is the Central Limit Theorem and why is it important?
“Suppose that we are interested in estimating the average height among all people. Collecting data for every person in the world is impossible. While we can’t obtain a height measurement from everyone in the population, we can still sample some people. The question now becomes, what can we say about the average height of the entire population given a single sample. The Central Limit Theorem addresses this question exactly.” Read more here.
What is sampling? How many sampling methods do you know?
“Data sampling is a statistical analysis technique used to select, manipulate and analyze a representative subset of data points to identify patterns and trends in the larger data set being examined.” Read the full answer here.
What is the difference between type I vs type II error?
“A type I error occurs when the null hypothesis is true, but is rejected. A type II error occurs when the null hypothesis is false, but erroneously fails to be rejected.” Read the full answer here.
What is linear regression? What do the terms p-value, coefficient, and r-squared value mean? What is the significance of each of these components?
. ----A linear regression is a good tool for quick predictive analysis: for example, the price of a house depends on a myriad of factors, such as its size or its location. In order to see the relationship between these variables, we need to build a linear regression, which predicts the line of best fit between them and can help conclude whether or not these two factors have a positive or negative relationship. Read more here and here.
. ----What are the assumptions required for linear regression?
There are four major assumptions: 1. There is a linear relationship between the dependent variables and the regressors, meaning the model you are creating actually fits the data, 2. The errors or residuals of the data are normally distributed and independent from each other, 3. There is minimal multicollinearity between explanatory variables, and 4. Homoscedasticity. This means the variance around the regression line is the same for all values of the predictor variable.
What is a statistical interaction?
”Basically, an interaction is when the effect of one factor (input variable) on the dependent variable (output variable) differs among levels of another factor.” Read more here.
What is selection bias?
“Selection (or ‘sampling’) bias occurs in an ‘active,’ sense when the sample data that is gathered and prepared for modeling has characteristics that are not representative of the true, future population of cases the model will see. That is, active selection bias occurs when a subset of the data are systematically (i.e., non-randomly) excluded from analysis.” Read more here.
What is an example of a data set with a non-Gaussian distribution?
“The Gaussian distribution is part of the Exponential family of distributions, but there are a lot more of them, with the same sort of ease of use, in many cases, and if the person doing the machine learning has a solid grounding in statistics, they can be utilized where appropriate.” Read more here.
What is the Binomial Probability Formula?
“The binomial distribution consists of the probabilities of each of the possible numbers of successes on N trials for independent events that each have a probability of π (the Greek letter pi) of occurring.” Read more here.
回复

使用道具 举报

🔗
angle22 2019-3-13 04:01:53 | 只看该作者
全局:
很有用,为朋友收藏起来!
回复

使用道具 举报

🔗
LuLukid 2019-3-25 12:33:41 | 只看该作者
本楼:
全局:
mark mark
回复

使用道具 举报

🔗
siizhuoo 2019-3-26 00:57:56 | 只看该作者
全局:
哇!楼主好赞!!!!
回复

使用道具 举报

全局:
请问楼主准备这些面试的时候用什么复习书比较合适啊 谢谢~~~~~
回复

使用道具 举报

🔗
ihatetaitea 2019-3-27 06:47:55 | 只看该作者
本楼:
全局:
谢谢分享
回复

使用道具 举报

🔗
 楼主| 小二来耕地 2019-3-28 04:23:25 | 只看该作者
全局:
pixie1994 发表于 2019-3-27 05:02
请问楼主准备这些面试的时候用什么复习书比较合适啊 谢谢~~~~~

我用了国内的统计书,然后还有美国研究生上课老师发的讲义~还有就是网上搜哈哈
回复

使用道具 举报

🔗
jackie_hu 2019-3-30 12:35:56 | 只看该作者
本楼:
全局:
xiexie lz!
回复

使用道具 举报

🔗
Sherry4869 2019-5-4 05:25:31 | 只看该作者
全局:
mark, mark, thank you!
回复

使用道具 举报

🔗
可乐甜了 2019-5-9 07:07:00 | 只看该作者
全局:
楼主有心了,能整理出来还把它放在网上,谢谢~
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 注册账号
隐私提醒:
  • ☑ 禁止发布广告,拉群,贴个人联系方式:找人请去🔗同学同事飞友,拉群请去🔗拉群结伴,广告请去🔗跳蚤市场,和 🔗租房广告|找室友
  • ☑ 论坛内容在发帖 30 分钟内可以编辑,过后则不能删帖。为防止被骚扰甚至人肉,不要公开留微信等联系方式,如有需求请以论坛私信方式发送。
  • ☑ 干货版块可免费使用 🔗超级匿名:面经(美国面经、中国面经、数科面经、PM面经),抖包袱(美国、中国)和录取汇报、定位选校版
  • ☑ 查阅全站 🔗各种匿名方法

本版积分规则

>
快速回复 返回顶部 返回列表