法学家 ›› 2026, Vol. 0 ›› Issue (3): 77-90.

• 主题研讨二:数字法学研究的多维视角 • 上一篇    下一篇

论公开个人数据爬取行为的侵权责任

沈健州   

  • 出版日期:2026-05-15 发布日期:2026-05-25
  • 作者简介:*沈健州,法学博士,上海交通大学凯原法学院副教授。
  • 基金资助:
    本文系国家社科基金青年项目“数据要素交易规则体系研究”(22CFX027)的阶段性成果。

On the Tort Liability of Scraping Publicly Available Personal Data

SHEN Jianzhou   

  • Online:2026-05-15 Published:2026-05-25
  • About author:Shen Jianzhou, Ph.D. in Law, Associate Professor of Koguan School of Law of Shanghai Jiao Tong University.

摘要: 数据爬取的侵权责任,本质上是权益保护与行为自由之平衡这一侵权法元问题在数字经济下的时代缩影。公开个人数据同时承载数据财产权益与个人信息权益,数据爬取行为可能造成多重损害后果。于数据财产权益,数据爬取对平台的数据利用造成竞争损害或干扰均可构成侵权,但两者在损害界定、过错判断和责任方式上均存在差异;在可携带权范围内,个人授权可使数据爬取造成的竞争损害正当化,但并不影响干扰型侵权的构成。于个人信息权益,爬取方对超出公开个人信息合理处理范围的数据爬取损害承担过错推定责任,且不能以遵守机器人协议证明其无过错,其虽可以“履行费用过高”为由不予删除数据,但仍需进行损害赔偿;若平台未尽到个人信息保护义务,应在其过错范围内承担补充责任,并可向爬取方追偿。

关键词: 数据爬取, 公开数据, 个人信息, 数据财产权, 数据流通

Abstract: The tort liability associated with data scraping essentially epitomizes the fundamental tort law dilemma of balancing rights protection with behavioral freedom within the digital economy.As publicly available personal data simultaneously embodies data property interests and personal information rights, scraping activities can precipitate multiple forms of damage.In terms of data property interests, scraping that inflicts either competitive harm or operational interference upon a platform's data utilization constitutes a tort; nevertheless, these two types of infringement differ significantly in damage assessment, fault determination, and methods of liability.Within the ambit of the right to data portability, individual authorization can legitimize the competitive damage induced by scraping, yet it does not preclude the constitution of an interference-based tort.Concerning personal information rights, the scraping party bears presumed fault liability for damages arising from scraping that exceeds the reasonable scope of processing publicly available personal data, and compliance with the Robots Exclusion Protocol cannot serve as an exculpatory defense.Although the scraper may invoke “excessive performance costs” as grounds to refuse data deletion, it remains liable for compensatory damages.Furthermore, if the source platform breaches its personal information protection obligations, it shall assume supplementary liability commensurate with its fault, whilst retaining the right to seek recourse against the scraping party.

Key words: Data Scraping, Public Available Data, Personal Information, Data Property Interests, Data Circulation