- I tried to copy the text. Couldn’t.
- I tried to use Reader Mode. Couldn’t.
- I tried to use Firefox’s webpage screenshot feature. Couldn’t.
- I tried to scrape it with a home-made script. Couldn’t.
- I tried to scrape it with an online LLM. Couldn’t.
- I tried to find the text in Archive.org. Couldn’t.
They want you to see that they ticked the boxes as a responsible company (“Ah, yes. A formal privacy policy. Ooh. Such a responsible company.”), but they don’t want you to hold them accountable for their words, because they want no registry of what they’ve promised!
Because I had a bit of time. It’s not perfect but ehh. Here is the markdown: itsdart-privacy-policy
An “AI” company who doesn’t want their content scraped? Say it ain’t so! /s
I just used Shottr to take a scrolling screenshot and captured the whole policy. I could OCR it, but I have no idea what to do with it from there…
Post it here
It’s too long for a comment but I made a Markdown copy in a post here: https://feddit.org/post/10009622
It’s pretty nasty—loads from a 3rd party domain (termly.io) that is blocked by uBO, and I had to disable it to load at all. After that, it loads into an iframe with a src of https://app.termly.io/policy-viewer/iframe-content.html?policyUUID=97db19c6-7afc-444b-bd38-9a2ac329fcac which you can load directly and print. It still has all the
user-select: none
css settings applied so you can’t highlight / copy / paste, but that’s easy enough to remove in the inspector.Crazy. Though I suspect the copy protection is done by the third party Termly, which hosts the policy.
To select the text (in Firefox), first right-click This Frame > Show Only This Frame. Press F12, expand <head>, find the second <style> block, right click it and Delete Note.
Several options to get around that. (1) Install a browser extension that will disable whatever block the page has, (2) open developer tools on a desktop browser, delete whatever javascript is preventing it, (3) possibly print to pdf, someone else suggested screenshot + OCR, etc.
Thanks for the suggestions. Had time to try to print it. Didn’t work. I’ll try the other options later.
what if they obfuscate the javascript?
Unless it’s required to load the words, it’s probably JavaScript that is trying to prevent the user from selecting it, so disabling javascript would make it selectable because the thing blocking the select is disabled. If javascript is loading the words in, then blocking javascript will make it so the page doesn’t load. But they are typically separate scripts from whatever is blocking the select, so addons can selectively block scripts that are detected to block things like select or right-click, etc. If they obfuscate the javascript to where the word load and the blocking are combined, then another method will probably be the easiest to employ like one of the other options I noted above, or going to developer options and copying the text from the inspector.