公卫人

 找回密码
 立即注册

QQ登录

只需一步,快速开始

查看: 4968|回复: 2

[转载] Google之文化组学(culturomics)上线

[复制链接]
iavjssssmqee 发表于 2010-12-19 21:01:11 | 显示全部楼层 |阅读模式

注册后推荐绑定QQ,之后方才可以使用下方的“用QQ帐号登录”。

您需要 登录 才可以下载或查看,没有账号?立即注册

x
http://www.culturomics.org/
) z0 Q1 @6 K3 H0 G' `4 }, ]
4 h- q+ P* @% n; x' c6 dOn December 16th, 2010, a team spanning the Cultural Observatory, Harvard, Encyclopaedia Britannica, the American Heritage Dictionary, and Google published a paper describing the Culturomics approach online in the journal Science, and at the same time launched the world's first real-time culturomic browser on Google Labs.
1 W0 B& k" w; C5 B( L4 ^
- |3 f2 R; y) W- H0 y% m2010年12月16日,世界上第一个实时文化组学浏览器上线。! h/ z  T$ X# M6 c( g! E

  _" e) i$ p1 G1 r3 j4 f& ]《科学》之相关报道和 Research Article 请见附件。$ p! _3 {) F* w" Q* |' ]1 w/ w- r

% k1 [$ u) [2 F摘录几段如下:+ G/ z8 C8 o+ g% e* a$ a5 O
. p& Q) \5 n( ^; M' A9 b, F% l
The resulting corpus contains over 500 billion words, in English (361 billion), French (45 billion), Spanish (45 billion), German (37 billion), Chinese (13 billion), Russian (35 billion), and Hebrew (2 billion). The oldest works were published in the 1500s. The early decades are represented by only a few books per year, comprising several hundred thousand words. By 1800, the corpus grows to 60 million words per year; by 1900, 1.4 billion; and by 2000, 8 billion. * ]1 o' Y8 K- M1 j2 `! I
# r6 l: o5 b& `" A8 x
The corpus cannot be read by a human. If you tried to read only the entries from the year 2000 alone, at the reasonable pace of 200 words/minute, without interruptions for food or sleep, it would take eighty years. The sequence of letters is one thousand times longer than the human genome: if you wrote it out in a straight line, it would reach to the moon and back 10 times over [8].
1 ]2 f: M, Y2 [) H
: I# q" o# D5 U

Science-2010-Michel-science[1].1199644.part1.rar

500 KB, 下载次数: 19

Science-2010-Michel-science[1].1199644.part2.rar

500 KB, 下载次数: 1

Science-2010-Michel-science[1].1199644.part3.rar

316.76 KB, 下载次数: 1

 楼主| iavjssssmqee 发表于 2010-12-19 21:02:21 | 显示全部楼层
NEWS OF THE WEEK
% R( P- a9 ~* I3 o
% x3 v) t8 ^% ]) M$ L. TDIGITAL DATA7 ]$ q3 W* ]9 P& p3 s
Google Opens Books to New Cultural Studies
6 @# |+ f1 g6 ]6 S8 x; _8 J
$ V% @9 G+ j% [/ Y9 M4 U见附件
" [/ `/ R1 v3 ~+ c1 X; B- E  m3 e5 m- _
$ P- O( u0 Y+ S0 z. E3 y

Science-2010-Bohannon-1600.pdf

180 KB, 下载次数: 1

回复

使用道具 举报

 楼主| iavjssssmqee 发表于 2010-12-19 21:02:47 | 显示全部楼层
更加深入的报道- i7 o! o6 k3 Q1 T1 l

* C, R+ t0 p+ R( ?【Science】文化组学
& L% H2 ^6 `( d* `" M2 {) \! o" ahttp://news.dxy.cn/bbs/topic/19019603?tpg=3&age=0
$ {- \1 I0 @+ }/ M4 T' e& i5 X* F" q6 f& z$ C# V

0 B1 C$ M5 L( P7 M
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

手机版|会员|至尊|接种|公卫人 ( 沪ICP备06060850号-3 )

GMT+8, 2024-5-19 05:22 , Processed in 0.061138 second(s), 8 queries , Gzip On, MemCached On.

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表