May 2009 - May 2017 论坛八周年-你的足迹,我的骄傲



查看: 3187|回复: 8

[DataScience] 【转载】如何成为一个数据科学家:Becoming a Data Scientist – Curriculum via Metr

[复制链接] |试试Instant~ |关注本帖
EroicaCMCS 发表于 2013-12-29 12:58:38 | 显示全部楼层 |阅读模式


您需要 登录 才可以下载或查看,没有帐号?获取更多干活,快来注册

本帖最后由 EroicaCMCS 于 2013-12-29 13:04 编辑

最近看到一张图,road to data scientist:
. 1point 3acres 璁哄潧


Becoming a Data Scientist – Curriculum via Metromap
8 Jul 2013 by Swami Chandrasekaran.鏈枃鍘熷垱鑷1point3acres璁哄潧

Data Science, Machine Learning, Big Data analytics, Cognitive Computing …. well all of us have been avalanched with articles, skills demand info graph’s and point of views on these topics (yawn!). One thing is for sure; you cannot become a data scientist overnight. Its a journey, for sure a challenging one. But how do you go about becoming one? Where to start? When do you start seeing light at the end of the tunnel? What is the learning roadmap? What tools and techniques do I need to know? How will you know when you have achieved your goal?
Given how critical visualization is for data science, ironically I was not able to find (except for a few), pragmatic and yet visual representation of what it takes to become a data scientist. So here is my modest attempt at creating a curriculum, a learning plan that one can use in this becoming a data scientist journey. I took inspiration from the metro maps and used it to depict the learning path. I organized the overall plan progressively into the following areas / domains,
  • Fundamentals
  • Statistics
  • Programming
  • Machine Learning
  • Text Mining / Natural Language Processing
  • Data Visualization
  • Big Data
  • Data Ingestion
  • Data Munging
  • Toolbox
Each area  / domain is represented as a “metro line”, with the stations depicting the topics you must learn / master / understand in a progressive fashion. The idea is you pick a line, catch a train and go thru all the stations (topics) till you reach the final destination (or) switch to the next line. I have progressively marked each station (line) 1 thru 10 to indicate the order in which you travel. You can use this as an individual learning plan to identify the areas you most want to develop and the acquire skills. By no means this is the end; but a solid start. Feel free to leave your comments and constructive feedback.
PS: I did not want to impose the use of any commercial tools in this plan. I have based this plan on tools/libraries available as open source for the most part. If you have access to a commercial software such as IBM SPSS or SAS Enterprise Miner, by all means go for it. The plan still holds good.
PS: I originally wanted to create an interactive visualization using D3.js or InfoVis. But wanted to get this out quickly. Maybe I will do an interactive map in the next iteration.






FTD2014 发表于 2013-12-29 14:17:56 | 显示全部楼层
good stuff!
回复 支持 反对

使用道具 举报

readman 发表于 2013-12-29 15:17:34 | 显示全部楼层
回复 支持 反对

使用道具 举报

粮果 发表于 2013-12-29 20:04:23 | 显示全部楼层
回复 支持 反对

使用道具 举报

 楼主| EroicaCMCS 发表于 2013-12-29 22:27:24 | 显示全部楼层
回复 支持 反对

使用道具 举报

nibuxing 发表于 2013-12-29 22:37:21 | 显示全部楼层
很清晰,和doing data science这本书基本一致,但要涉及的面太广了,还在努力学习中。
回复 支持 反对

使用道具 举报

小K 发表于 2013-12-30 04:58:20 | 显示全部楼层
很强大! 不过就是感觉侧重有不同。做text mining的跟做统计的基本不是对着同一种职业的样子。。。。
回复 支持 反对

使用道具 举报

readman 发表于 2013-12-30 09:09:45 | 显示全部楼层
回复 支持 反对

使用道具 举报

yifan66 发表于 2014-1-3 16:40:55 | 显示全部楼层
回复 支持 反对

使用道具 举报



一亩三分地推荐上一条 /5 下一条

手机版|小黑屋|一亩三分地论坛声明 ( 沪ICP备11015994号 )

custom counter

GMT+8, 2017-5-29 03:46

Powered by Discuz! X3

© 2001-2013 Comsenz Inc. Design By HUXTeam

快速回复 返回顶部 返回列表