一位爬虫工程师写的爬虫!把估值175亿的马蜂窝给捅了!
碰巧这个团队在美国学的都是数据分析,一怒之下决定训练一个模型,用于筛选餐饮评论的水军,恰巧马蜂窝成了他们的练手对象,没想到不爬则矣,一爬把马蜂窝给捅了!
这篇文章中表示:“在马蜂窝网站上,发现了7454个抄袭账号,平均每个人从携程、艺龙、美团、Agoda、Yelp上抄袭搬运了数千条点评,合计抄袭572万条餐饮点评,1221万条酒店点评,占到官网声称总点评数的85%。”
文章中还列举了几项抄袭石锤:
还有的抄袭账号自相矛盾,性别忽男忽女,甚至有些直接调用Google翻译接口
马蜂窝回应
22日早上,马蜂窝也随即发布了声明,表示会对涉嫌虚假的信息,进行查处。
另据最新消息,针对自媒体报道的马蜂窝数据造假一事,马蜂窝已向北京市朝阳区人民法院提起诉讼,称乎睿数据侵犯名誉权,目前已获立案。
23日,马蜂窝CEO陈罡也针对此事作出回应:马蜂窝在餐饮等点评数据方面存在部分问题,但远没有外界所表述的那么夸大。目前已经重新梳理工作流程,堵住漏洞。
网友怎么说?
目前,这件事已经在各大论坛都议论纷纷:
私信小编007即可获取惊喜大礼包一份哦!
这件事总算让我见识到程序员的厉害之处了:
‘水军’和‘爬虫’一直都存在于互联网行业,因为流量和数据对于一个互联网企业不可或缺,关于数据纠纷问题在互联网更是屡见不鲜,目前此事真相还未明了,我们暂时不予置评。
但通过这件事情告诉我们, 惹谁也别惹程序员 !尤其是有正义感又闲的技术宅。
{"weixin":{"label":"微信","name":"weixin","selected":true,"value":true,"sortid":"1","shareid":"weixin","sharetitle":"分享到微信","event":"shareToWeiXin","lang":"shareWeb_WeiXin"},"copy":{"label":"复制网址","name":"copy","selected":true,"value":true,"sortid":"2","shareid":"copy","sharetitle":"复制网址","event":"copy_url","lang":"shareWeb_Copy"},"qq":{"label":"QQ好友","name":"qq","selected":true,"value":false,"sortid":"1","shareid":"qq","sharetitle":"分享到QQ","event":"shareToQQ","lang":"shareWeb_QQ"},"sina_weibo":{"label":"新浪微博","name":"sina_weibo","selected":true,"value":true,"sortid":"4","shareid":"sina_weibo","sharetitle":"分享到新浪微博","event":"shareToSinaWB","lang":"shareWeb_SinaWeiBo"},"qq_zone":{"label":"QQ空间","name":"qq_zone","selected":true,"value":true,"sortid":"5","shareid":"qq_zone","sharetitle":"分享到QQ空间","event":"shareToQzone","lang":"shareWeb_QQZone"},"renren":{"label":"人人网","name":"renren","selected":true,"value":true,"sortid":"7","shareid":"renren","sharetitle":"分享到人人网","event":"shareToRenren","lang":"shareWeb_RenRen"},"douban":{"label":"豆瓣网","name":"douban","selected":true,"value":true,"sortid":"8","shareid":"douban","sharetitle":"分享到豆瓣网","event":"shareToDouban","lang":"shareWeb_DouBan"},"baidu_tieba":{"label":"百度贴吧","name":"baidu_tieba","selected":true,"value":true,"sortid":"10","shareid":"baidu_tieba","sharetitle":"分享到百度贴吧","event":"shareToTieba","lang":"shareWeb_TieBa"},"Facebook":{"label":"Facebook","name":"Facebook","selected":true,"value":true,"sortid":"11","shareid":"Facebook","sharetitle":"分享到FaceBook","event":"shareToFacebook","lang":"shareWeb_Facebook"},"Twitter":{"label":"Twitter","name":"Twitter","selected":true,"value":true,"sortid":"12","shareid":"Twitter","sharetitle":"分享到Twitter","event":"shareToTwitter","lang":"shareWeb_Twitter"},"LinkedIn":{"label":"LinkedIn","name":"LinkedIn","selected":true,"value":true,"sortid":"13","shareid":"LinkedIn","sharetitle":"分享到linkedIn","event":"shareToLinkedin","lang":"shareWeb_Linkedin"},"whatsapp":{"label":"whatsapp","name":"whatsapp","selected":true,"value":true,"sortid":"15","shareid":"whatsapp","sharetitle":"分享到whatsapp","event":"shareToWhatsapp","lang":"shareWeb_whatsapp"},"line":{"label":"line","name":"line","selected":true,"value":true,"sortid":"15","shareid":"line","sharetitle":"分享到line","event":"shareToLine","lang":"shareWeb_line"},"qq_weibo":{"label":"腾讯微博","name":"qq_weibo","selected":true,"value":true,"sortid":"3","shareid":"qq_weibo","sharetitle":"分享到腾讯微博","event":"shareToQQwb","lang":"shareWeb_QQWeiBo"},"peopleBlog":{"label":"人民微博","name":"propleBlog","selected":true,"value":true,"sortid":"14","shareid":"propleBlog","sharetitle":"分享到人民微博","event":"shareToPeopleBlog","lang":"shareWeb_peopleBlog"}}