Aug 4, 2007 - Why I am writing blog posts in English

Comments

Why I am writing blog posts in English.

The reason is elementary, my dear Watson:

  1. “The British are coming!” (Just kidding)

  2. I want to improve my English

  3. While we wait for the babel fish, I have do something.

“You forgot Poland.” — George W. Bush

====Chinese Version==中文版==

我为什么用英文写作?

最近间或写一些英文的文章, 一些读者来信表示不快. 因为我的母语是中文, Blog 也大部分是朋友在看, 用英文写作给他们的阅读也带来了麻烦. 那么, 为什么我要用英文写作呢? 我可以用两句英语解释清楚, 因为美国朋友很欢迎我用英文写作. 但我必须费点功夫告诉我的中文读者, 我这样做不是瞎折腾, 是有我自己的原因的.

  1. 我想通过写作提高英语水平

我 的英语不好, 这一点我很小就知道了. 高中时候, 英语没有城市里的小孩好; 大学时候, 英语没有那些刻苦准备GT的同学好; 四六级过是过了, 也不能算好; 跌跌撞撞到了美国, 更加知道自己英语很糟糕. 因此我觉得要有意识的提高自己的英语. 而英文写作很有帮助.

我从去年开 始订阅了<时代>, 新单词新句型记录了很多, 可惜从来没有实际用过, 转眼即忘. 英文写作能让我牢记单词和句型, 用多了还能信手拈来, 脱口而出. 如果您细心看, 就觉得我用的句型大部分会来自最近几期<时代>上的文章, 引文也来自互联网上最新的一些资讯. 我觉得模仿是比较有效的方法. 而且既然写出来给人看, 就要多次检查, 防止语法错误等等. 这样战战兢兢好几次, 现在也能独立于语法检查器写出没错误的句子了, 算是不小的进步了.

  1. 我想让更多的人了解我, 和我交流

IT 圈子中, 大家都知道王建硕(Jiansuo Wang), 毛向辉 (Issac Mao). 他们都在坚持写英文的Blog. 他们的英语水平或许都不算特别好, 不过他们都有很多的国际读者. 思想是核心, 语言是承载. 可惜在Babel Fish 到来之前, 我们还都必须受制于语言. 用英语写作, 不是我不爱母语, 而是我想更多的把我用母语表达的意思用另一种语言, 传给更多的人. 至于写英文是否崇洋媚外, 我倒不想辩论. 我尽量做到行文不中英夹杂, 因为我觉得夹杂中英文才是对母语最大的玷污. 如果您觉得看英文不舒服, 略过去就是了. 以后我尽量每周写一两篇英文的, 大家可以捧捧场, 也可以喝个倒彩. 我倒是希望通过写作英文Blog, 我的国际读者相对多一点, 读者之间的交流更加多元一点.

小提示: 您可以使用 Google Translate 把英文翻译过来. 虽然质量不够高, 差不多也能知道大致的意思了.

Aug 2, 2007 - To do or not to do

Comments

–My thoughts on making choices.

I have to admit that starting an article with the cliché “to be or not to be” 1 is somehow awkward. But actually, this is the topic of this article. No, I am not trying to answer the question asked by Hamlet. Everyone asks the same question towards different things over and over again everyday, and so do I. Several bits and pieces came to my mind recently, so I just record them down. Instead of getting the answer to the ultimate question of life, the universe and everything, which is 42 2, here, I want to figure out my principles in making choices.

**

Thought #0: Making choice is a choice, or why only the paranoid survives.**

Lots of people won’t make decisions unless they have sufficient information. But in the real world, information is necessary, but never sufficient, for making decisions. The idea never making decision before you get sufficient information is common but misleading. The long period of making decision finally hurts the outcome of that decision for lacking of time in implement it.

In making choices, our goal is to choose the best one. However, usually there is no obvious superior as the world is complicated, that’s the reason why information is needed to distinguish all the alternatives. Keep in mind that information should be helpful instead of delusive. Sometimes, conflicted information will make people lost and let the decision-making procedure be very painful. Therefore, in the decision-making process, one still makes decisions like ‘whether I should take this information’ or ‘whether I should wait for more information’. I would call this procedure “meta-decision making”.

My idea here is at any time, never let the meta-decision making take up the actual decision making time. I’ve seen more than once that someone staggered at the opportunity and hesitated before deciding. Needless to say, they finally make no better decision than roulette. There is a famous saying that only the paranoid survives. The paranoids usually make decision at the very beginning and hold on straight to the end. There is no meta-decision making for them. Sometimes they make worse decision, but hopefully, they can make superior decisions surprisingly, and they survive. Thus, please focus on the decision making itself and do not let the meta-decision murder your decision. Be aware of them, they may kill your decision.

**

Thought #1: Occam’s razor, or why more is less.**

Once upon a time in my life, I had three pretty good offers, and I have to choose one among them finally. Frankly speaking, I’ve never imaged that. Anyway, I had to choose one. I began to realize that the more is not always the better. Sometimes, we do need an Occam’s razor 3. Why is that?

The reason was because I felt satisfied with any of the choices, which means I could not simply nuke any one of them. Actually I shouldn’t always mention my achievements in the past, but please allow me to explain it in brief. My first choice is attending Peking University for the graduate study. Before taking the graduate entrance exam, I just want a try. I didn’t want anything more than an exam score. The thing turned out to be amazing that I ranked the 1st among all the students in that major–one of my favorite majors–Bio informatics. My second choice is Google China. At that time, Tina (I guess she is a senior assistant to Dr. Kai-fu Lee) told me that I have a probability of 99.9% to get an offer from Google China. In the meanwhile, I got the offer here, Washington University. Well, for some distinguished students, probably they can withdraw all of these and choose Stanford or MIT. However, for me, all these three are really really good — I had my beloved girlfriend studying in Peking University at that time; Google China was (and is) shining and flourishing; I wanted to stay in Beijing as lots of my relatives and friends were there; I wanted to have my own start up in Zhongguancun with some friends there and Web 2.0 was a buzzword at that time; USA is a free land and the major is computer science, my dreaming major; My advisor was (and is) doing excellent research work in his field; professors at Peking University were quite nice to me; to stay in Beijing would be definitely better for my parents; Gee, tons of pros and cons in my mind at that time. All of these things are twisted together. As a result, I got serious insomnia and was in a blue funk in making this decision. I would rather choose to hide under the rock.

Then, I would like to say that my uncle and my advisor gave me the Occam’s razor. My uncle suggested that I shouldn’t consider too much about others’ idea; and my advisor just told me that I could choose Beijing and Google in future. I’ve noticed that, unlike me, someone takes a different decision 4. I would like to say, there is no standard Occam’s razor. I absolutely admire him if he didn’t get insomnia in making this decision :). I am saying that the more is not the better is not because I’ve hold such three good offers and am trying to show off, I just want to say that keeping the life simple and stupid is indeed very necessary. For more details about why more is less, I recommend a Google Video 5 for you guys.

**

Thought #2: Murphy’s Law, or how to use greedy algorithm.**

This is about making decision between the current worse choice and the future better choice. Some people will take a risk of 80% probability to get another opportunity in the near future that is 20% superior over the current one. It sounds perfect, right? Since you can get a better one at a relative high probability, why bother with the current one. Now let’s do a simply mathematics. The expectation of the outcome of the future opportunity is 80% * (1+20%) = 96%. Boo, it’s worst than 1, so why not holding the current opportunity?

Most people, if not all, are very optimistic towards the future opportunities, and this 80-20 principle is universal acknowledged. But simple mathematics reveals the truth that one should never be too optimistic to put a bid on the future, unless it’s 25% or more better than the current one. In fact, in my opinion, 25% is not enough. If we take into account the time wasted in waiting for the future, I won’t bid for it unless it’s 30% or more better than the current one.

I am not trying to persuade others to be conservative. In fact, I encourage taking a risk on high-rewarded opportunity. But the Murphy’s Law states “things will go wrong in any given situation if you give them a chance.” 1 The future event will always have a larger probability to go wrong than your expectation. Therefore, if you want to be greedy, the best algorithm is not choosing the best choice in terms of result, but the best choice in terms of expectation. That’s the usage of probability. :)

**

Thought #3: No bargain choice, or don’t catch the deal if you don’t want it.**

Some people make decisions to do something not based on their need, but because doing those are easy. In other word, they want to catch the deal. For instance, a friend of mine had two choices: one was going to a big company as an intern; the other was going to US for graduate study. The previous offer would delay his admission for half a year. Actually, the previous offer, even accepted, wouldn’t help much about the graduate study here. However, he would like to choose the first one because he thought that the later one was “difficult” for him at that moment. Therefore, the previous choice is like a bargain — you can live without it, but if it comes, just get it.

I am going to say “no” to bargain choice. First, bargain choice will misdirect one from the main road. Second, as Paul Graham pointed out, bargain choice will consume your energy 6, and you will be controlled by all these bargain choices. If you don’t really want it, why get it? Remember that more is less, and too much bargain choice will degrade your vision in making choices.

I’ve put all four thoughts here. If you have other principles or idea that is worth while sharing, why not leave your comments? ;)

PS: I am not an expert in making choice per se. Here I just summarize my thoughts in making choice. I will be very glad if someone can help translate this article back to Chinese, as I really have no time to do this.

References:

Jul 25, 2007 - 10个我使用的社会化网络站点

Comments

看到了AW的文章, 仿照写一下, 10个我使用的社会化网络站点:


第一类, 不用就活不下去的四个:

Last.fm: 只要听歌, 一定开着 Last.fm. 既能让算法学习出我喜欢的音乐风格, 又能推荐一些好的音乐. 属于和计算机同开同关的类型.

Digg/Slashdot: 只要开着浏览器, 就可以在个性化主页上看到不断推送的新闻. 看了这些, 新闻基本上也就全了. 属于和浏览器同开同关的类型

Facebook: 每天检查三次, 看看朋友们在干啥. 还常常改改自己的状态(如Twitter). 每天饭前饭后六次, 属于和一日三餐同步的类型.

Wikipedia: 细心看了一下我的访问历史, 发现每天访问10次以上. 属于随工作节奏实时访问的类型.


第二类: 不用也行, 但是用了生活更美好的. [这三个我一般很少在浏览器里面输URL 直接访问]

del.icio.us: 别人都拿他做书签. 我反的, 我用它搜标签. 因为我有一个”流氓”的 emailtome 程序, 把网页抓回来用Gmail 全文搜索了. 所以很少用书签. del.icio.us 上东西都很全, 直接搜教程和技术文档比 Google 质量要好. 这个毕竟是人肉搜索. 一般通过浏览器的按钮和搜索框, 不直接上.

YouTube: 看到Last.fm 推荐的不熟悉的乐队或者新歌, 一般第一件事情就是上去看 MV. (上次Linkin Park 的 What I’ve Done 就是推荐给我的, 看了MV 以后超级喜欢.) 有时候点其他网站也能走到YouTube. 不过不会主动上 YouTube 首页看.

Craigslist: 找二手车, 电脑家具什么的. 一般通过 Yahoo! Pipe 去访问, 不怎么直接上首页.


第三类: 偶尔用用的, 按需使用的.

Linkedin: 每周上一次, 找找同学, 朋友. 看看猎头找啥人, 一些公司的用人需求是啥, 投资人在找什么项目等等 这些.

Amazon: 买书的时候上. 英文的技术书评比豆瓣要好一些, 个人觉得.

Flickr/Picasa Web: 没啥说的, 贴照片的.

和AW对比一下发现, 可能是因为在国外的原因, 国内的豆瓣, 抓虾使用得都不算太多, 至少不会想着天天上去看看. 若邻也曾用过, 不过圈子不在那里. 校内我没有南京大学的邮箱, 所以连帐号都没有. 南大的小百合倒是天天上, 不过只是潜水, 偶尔回答数学问题, 算不得使用了. 不太习惯使用 Twitter, 觉得很烦, 所以饭否这些从来没用过. 其他有名的BBS 论坛如猫扑天涯从来没有用过. 到现在都不会用 Cterm 上水木清华. MITBBS(海外华人最大集散地) 今天也才第二次上. V2EX 和豆瓣小组确实很2.0, 可惜没认识的人在上面灌水, 所以也几乎没怎么上过了. 全球火爆的 MySpaces 只有一个注册了从没用过的帐号, 因为身边没人用. 百度倒是有我的一个同名贴吧, 不过不会因此就泡在贴吧灌水, 因为周围没人陪着灌. 可见, 一个人的圈子决定了使用怎样的网站(反过来也决定了未来的圈子).

Jul 24, 2007 - 我号召中国互联网全体抵制中国缘

Comments

中国缘是我见到的流氓的下确界--没比他更流氓的了. 这样的公司在中国互联网, 只能是让中国的互联网更加黑暗, 让中国的网民更加受害. 中国缘网站盗取 MSN 隐私, 建立互不信任的社区以及涉嫌欺骗倒卖虚假原始股的行为已经极大的伤害了整个互联网的运行环境.

A. 中国缘网站盗取MSN 客户资料. 各位只要搜 中国缘 + 流氓, 就知道此恶劣行径已经人人喊打. 见过利用联系人传播的病毒, 见过赖着不走的流氓软件, 还真没见过盗取联系人拉用户的. 请问这样的SNS网站对整个SNS行业究竟该有有多大的破坏? 还有人敢用MSN么, 还有人敢注册SNS帐号么? 用户从一开始就是被硬生生的拉进一个貌似好多朋友都在其实每个人都是被骗的网站, 这样的SNS又何谈凝聚力? 中国缘这样的流氓拉客手段, 已经是近乎抢劫了. 抢劫还不够, 还要连着九族抢. 如此SNS, 值得国内所有做SNS的公司狠批.

B. 所谓的倒卖原始股到NASDAQ上市, 完全就是非法集资和坑蒙拐骗. 有点经济常识的人都知道, IPO前的原始股是不可能流通认购的. 如果这样搞, 投资人靠什么受益呢. 我可以 99% 确定, 中国缘钱要烧光了, 投资人(有没有投资人我都怀疑)也不投钱了, 就想出非法集资的主意了. 国家才抓了一个带头大哥,应该不缺下一个.至于NASDAQ,少吹牛了,中国要上NASDAQ的每年数来数去就几个, 轮到你中国缘的时候, Windows 都要变 3000了. 你们还是在家好好准备着流氓插件升级吧. 流氓恒久远,注意永流传!

C.中国缘网站号称由海归创业. 打着华人白领高端交友的旗号, 用做互联网2.0为名, 行极其下作流氓之能事. 如果不把这个网站揭出来狠批, 就会真的如王小峰DV中所说, 很多人就会相信搞互联网和 IT 的都是坑蒙拐骗. 这些坏的印象, 对于正在互联网进行创业的创业者, 对于整个中国互联网, 未来都是非常严重的摧残. 我们的互联网不需要这种流氓公司, 也应该主动踢出这种害群之马.

如果您是学生, 需要社区感, 可以去校内或者 Facebook, 如果您是白领, 可以使用Linkedin 或者若邻. 如果您需要交友, 特别是国际交友, 可以使用 AsianFriendFinder 和未名交友. 如果您需要一个新的IM, 可以使用 Google Talk. 如果您需要冒泡, 可以选择 Twitter 和饭否这些. 如果您想投资一夜暴富, 郑重的告诉你, 投中国缘的原始股没前途. (况且, 金融上说, 那个原始股本来就是违规操作, 所以, 连投资都算不上).

我相信, 流氓网站总会歇菜, 耍流氓坑蒙拐骗的某些傻叉海归总会被法律制裁. 但在此之前, 不妨号召广大网友, 一起抵制中国缘. 我们需要一个良性的互联网,我们创业者需要正面的形象.这种老鼠屎和落水狗,不踢出打死是不行的. 虽然我一己之力绵薄, 但为了若干年后我们这一般有理想的人从事的事业不被人认为是坑蒙拐骗, 只能大声疾呼. 我们信仰互联网, 就不能允许这些流氓强奸互联网.

附: 各位网友随意转载, 修改本文和张贴. 本篇连CC都不要了, 商业媒体自由免费转载. 本人从来不惧怕和流氓战斗.

Jul 24, 2007 - 点名凑10条

Comments

Solrex 的点名.

1, 你的专业是什么,她是你最喜欢的么?如果不是,那么你最喜欢的专业是什么?

我原来的专业是信息与计算科学(计算数学). 现在的专业是 Computer Science, 具体说是 Artificial Intelligence and Nonlinear Programming. 可能最喜欢的就是计算机科学吧. 对于数学, 理论物理这些都有兴趣, 但是谈不上热爱, 只是作为一个基础罢了. 其他文学艺术谈不上喜欢的专业了. 只能算业余爱好.

2, 在你最喜欢的专业里请你列举出至少五位大牛, 其中一位是这个专业奠基者, 另外一位是你最最崇拜的.

CS 图灵奖每个都是大牛了.

计算机和人工智能最早的奠基者应该最早应该算莱布尼兹 (Leibniz), 他提出能否任意表达一个命题, 能否有一个过程任意证明一个命题. 这个问题包含了知识表示, 本体论和通用计算机器的思想.

现代计算机科学奠基人是图灵(Alan Turing) 和 Alonzo Church, 一个提出图灵机, 一个提出递归函数演算. 图灵还定义了什么是智能, 也就是著名的图灵测试.

D. E. Knuth 显然是算法分析的奠基人, 也是我最崇拜的.

John von Neumann 是第一台计算机 ENIAC 的制造者, 而且及其聪明. 很崇拜他, 属于可远观不可企及的.

3, 你觉得在这个专业里你能做出让自己满意的贡献么?说说你如此回答的理由?

目前尚不清楚吧, 只有踏踏实实慢慢做了. 老板要求我能解决点大问题, 有用的问题, 而不是琐碎的问题. 所以, 路漫漫, 很难说. 我想只要努力了就不后悔,

4,你满意的贡献能够具体描述给我看看么?

希望某天, 有一个公共的网站, 大家可以上去, 提交自己的NLP 问题, 后台的超级计算机能够很好很快的解出来.

在 AI 那边, 希望能让传统的组合爆炸的算法快一点, 能在合理的时间中解出来吧.

5,就你的这个专业,全球那一所学校是最强的?

NLP 主要就集中在芝加哥地区, 西北大学, 普渡大学, 芝加哥大学, UIUC, 威斯康星和阿岗国家实验室都是顶极的.

英国的剑桥也很强. Top 1 我还的确不知道.

AI 我们这个分支, 这几年好像我们老板做得最好吧. 不过这个分支我读的论文太少了, 主要是一个师兄在做.

6,你认为从事此学科的研究需要哪些方面的能力?你自己具备几条?

数学第一条吧. 还行

计算机编程第二条, 尚可

独立的读Paper 想点子的能力: 比较差

灵感: 目前没有.

耐性: 还行, 能坐下来.

7,希望自己的老婆或者丈夫也是从事这方面研究的么?

我自己其实也不是以研究为终身事业的吧. 至于未来的另一半, 随便她做什么了, 没什么希望不希望的.

8,人类 历史 长河中哪位智者是你最最高山仰止的?

好难回答呀. 比较喜欢庄子.

9,你的人生哲学

大拙就是大巧, 所以用点笨方法, 往往就是意想不到的捷径.

10.每天你花多长时间在你的学科上,分别是做什么用的?我最最感兴趣的是你花在读论文上的时间

10个小时以上. 主要是编程, 得要4-5个小时, 因为目前系统比较大. 其次读论文看书吧, 每天2-3小时. 不过我这个人比较闲不住, 编程会间杂着会上上网看看Blog. 我读论文比较杂, 而且交叉阅读, 跳来跳去, 所以其实有效阅读时间也不算多. 还有2-3个小时我会学点计算机技术, 比如一门新语言, 一个新框架这些. 这些算是学科上的时间, 但实际上是和研究不怎么相关的了.

接着, 恩, 学术圈的, 就点 Logpie 和 Gookbaby.