Federal Court makes clear: Website scraping is illegal

As a general rule, we all know it is not a good idea to scrape content from a website, yet some companies persist in this behaviour contrary to law and best practice.

Lisa R. Lifshitz

As a general rule, we all know it is not a good idea to scrape content from a website, yet some companies persist in this behaviour contrary to law and best practice.

 

On April 15, Justice Richard Southcott of the Federal Court of Canada issued a permanent injunction against Mongohouse.com, aka, MongoHouse.ca, Sheng Lan Mai aka Maxim Mai, Kun Xu, 2565707 Ontario Inc. and Jing Liu (collectively, the Mongohouse defendants) in a stinging rebuke against web/data scrapers, upholding the copyright of the Toronto Real Estate Board in its internal multiple-listings service systems and database.

 

The Toronto Real Estate Board, a not-for-profit corporation representing more than 50,000 realtors across the Greater Toronto Area, is the creator, author and custodian of the TREB Multiple Listing Service®, a co-operative service that provides more than 100 online services, including access to active real estate listings, detailed property descriptions, archival information, photography, neighbourhood descriptions and other curated information related to real property (including purchase prices) that is available for use by its members for a fee and its partner real estate boards’ members. TREB has reciprocal agreements with other real estate boards across Canada and is affiliated with the Canadian Real Estate Association, the registered owner of the multiple listing service registered trademark and the MLS design.

 

The Toronto Real Estate Board’s statement of claim, filed on Sept. 12, 2018, alleged that Mongohouse.com’s entire existence (and business) was based upon its unauthorized access of the TREB MLS® system and infringing use and distribution of TREB MLS® information for a commercial purpose, namely monetizing TREB’s content for Mongohouse’s and its owners’ financial benefit. 

 

The Mongohouse defendants were accused of deactivating, bypassing and circumventing the various technological protection measures actively deployed by TREB to limit and restrict access to the TREB MLS® system and the MLS® information, in violation of various confidentiality and copyright protection obligations of TREB’s listing agreements, its authorized user agreements, statutory obligations and third-party licence agreements with information-supply partners including Teranet Inc. and the Municipal Property Assessment Corporation.   

 

The claim also named the software engineer who was the alleged author of the software used to crack TREB’s technological protection measures to gain access to the TREB MLS® information and display it on Mongohouse. 

 

Various U.S. and Canadian internet service providers were also originally named in the claim for the purposes of obtaining injunctive relief requiring them to comply with take-down notices and to cease hosting Mongohouse as well as providing all information regarding the identity of the current and past site owners and operators.

 

Currently, TREB members can only access the TREB MLS® system by providing two levels of credential authorization to authenticate their user names, passwords and using a PIN number to gain access. Members are also required to abide by the TREB MLS® rules and policies, which require them to agree to the TREB authorized user agreement terms and conditions.

 

Section four of the AUA explicitly prohibits authorized users from using, copying, reproducing or exploiting “the MLS Database contrary to various By-Laws, the MLS Rules and MLS Policies” or Ontario’s Real Estate and Business Brokers Act. 

 

Authorized Users are also expressly forbidden, under s. 7(c) of the AUA, to “decompile, reverse-engineer, disable, modify, analyze or create derivative works of the software, MLS Database or BRS Database.” TREB presently uses, as described in the claim, a variety of software applications and protection measures to actively prevent third parties from gaining unauthorized access to and download or stream the TREB MLS® information, including antivirus software, third-party anti-scraping services (ongoing monitoring and check/validation processing), hosting-service firewalls and intrusion-detection systems, anti-malware software and detection systems and encrypted token authentication protocols.

 

Mongohouse stood accused of subverting the TPMs put in place by TREB and populating its website, on a daily basis and at no charge, with content that it had copied from the TREB MLS® system, including new property listing, prices, photography and detailed property descriptions. 

 

Using maps with indicators to show new property listings and recently sold properties to its 50,000 registered users, in a similar form and with a similar content and layout to that provided by TREB, Mongohouse also offered advertising space to real estate-related businesses in competition with TREB.

 

The claim alleged that the information contained on the Mongohouse site could only have been available from TREB’s MLS® system, and TREB asserted that they had actually verified this fact by placing certain unique information in the TREB MLS® system for access by members (and restricting how this information could be displayed). TREB subsequently found that the information was suddenly available on the Mongohouse website within 24 to 48 hours following its initial placement on the TREB site, proof that the content was actually being scraped from the TREB MLS® system.

 

TREB argued that the TREB MLS® system, including its design, layout, presentation, manner of access and form/selection of information as well as the information contained therein, is proprietary to TREB (even though not all of the information is exclusive to it) and that it had spent millions of dollars annually for upkeep, maintenance and support of the online service for its members. Moreover, TREB claimed that the unique collection of information compiled, organized and maintained by TREB in the TREB MLS® system is a copyrightable work (namely a compilation that is original, independently created and organized that requires a great degree of skill, judgment and labour in its overall selection and arrangement) and that also contains confidential and proprietary information. Accordingly, as the author and content creator, TREB holds the copyright interest associated with the TREB MLS® info (and associated copyrights as defined in the Copyright Act and, therefore, only TREB MLS® has the right to authorize its use, copying, streaming, distribution or dissemination. Mongohouse, through the use of the illegally obtained TREB MLS® information, was passing itself off as offering the same services as offered by TREB (without users having to pay the associated fees), infringing TREB’s copyrights and exclusive rights in order to profit from advertising revenue, etc. 

 

In addition to an interlocutory and permanent injunction against Mongohouse and the defendants, TREB sought: damages for each breach and infringement of TREB’s proprietary information and copyrights in the amount of $100,000; an accounting as to the receipts by each defendant arising from such infringement of TREB’s copyrights and breaches of confidential information; damages in the amount of $2,000,000 under the Trade-Marks Act for infringement, passing off, confusion and loss of reputation and  pre-and post-judgment interest as provided by law and TREB’s costs on a solicitor and client basis.

 

By Oct. 1, 2018, not long after the filing of the claim, Mongohouse had taken down its site and it remained offline. However, on Oct. 30, 2018, the Mongohouse defendants responded with their own spirited, 73-page statement of defence and counterclaim denying virtually every allegation in the claim and counterclaiming for lost revenue. However, the Federal Court was not convinced and, with the consent of the Mongohouse defendants, definitively found and declared in its order that: As the owner of the TREB MLS® listing services and TREB MLS® database, TREB is the owner of the associated copyrights pursuant to the act; the unauthorized copying, data scraping, downloading, display, distribution, access to make available for distribution and streaming for public display of any TREB MLS® data is a breach of TREB’s proprietary rights and copyrights associated with the TREB MLS® service and  any access to the TREB MLS® system other than as authorized by TREB using any means to avoid, bypass, deactivate, impair or to circumvent in any manner a TPM is a breach of s. 41 of the act and is an infringement of TREB’s rights. 

 

The court further granted a permanent injunction against the Mongohouse defendants, restraining each of them (including their officers, directors, employees, agents, assigns or any person acting under their instructions) from: accessing, copying, data scraping, downloading, displaying, distributing, accessing to make available for distribution and streaming for public display any TREB MLS® data or information, unless expressly authorized in writing by TREB; using any method to avoid, bypass, remove, deactivate, impair or circumvent any technological protection measures put in place to protect or limit access to the TREB MLS® system and data; operating, conducting or having any involvement in or providing or offering means to access the TREB MLS® system or assisting in the collection or display of the TREB MLS® data, unless expressly authorized in writing by TREB; maintaining, operating, implementing, marketing or having any involvement with any business or enterprise used in any manner or form for the purpose of providing or offering a means to access the TREB MLS® system via any means or method, including any internet-based technology, without the express written permission of TREB. 

 

The action was otherwise dismissed on a without cost basis and Mongohouse’s counterclaim was also dismissed on a without costs basis.

 

What are the takeaways of this decision? 

 

The Federal Court has clearly laid to rest any question regarding the legality of web scraping.  The bottom line for prospective digital companies is: Engaging in unauthorized copying, data scraping, downloading and distributing third-party content without the consent of the original rights holders is illegal under the act; and web scraping is not the basis of a good business or revenue model that will likely be profitable or have staying power in the longer term.