+Larry Posted October 26, 2019 Share Posted October 26, 2019 There are several threads here that have some really valuable technical information that I'd like to have handy. I currently (sort of) have them organized by copying the links, and the links are organized by broad topic. Is there a way to copy the entire contents of a thread (and paste into a WP)? I know that I can copy/paste each individual post, but that is a real drag. Any thoughts? -Larry 1 Quote Link to comment Share on other sites More sharing options...
flashjazzcat Posted October 26, 2019 Share Posted October 26, 2019 33 minutes ago, Larry said: There are several threads here that have some really valuable technical information that I'd like to have handy. I currently (sort of) have them organized by copying the links, and the links are organized by broad topic. Is there a way to copy the entire contents of a thread (and paste into a WP)? I know that I can copy/paste each individual post, but that is a real drag. Any thoughts? There used to be a method of archiving PM threads as PDFs; I'm not sure if this functionality still exists, but I found it useful. Not sure either if this was ever possible with public threads, but such a facility would be handy. Might be worth posting some suggestions in the forum feedback sub-forum. 1 Quote Link to comment Share on other sites More sharing options...
Rybags Posted October 26, 2019 Share Posted October 26, 2019 First thought I had was "Reader View" from Firefox but it seems to be for narration and doesn't even look to work properly. Possibly other browsers might have something to offer, or maybe there's a browser extension that can do it. But one thing that could work is to just quote everything on a page then block copy it to your Word document. A bit dirty but might be sufficient. 1 Quote Link to comment Share on other sites More sharing options...
777ismyname Posted October 26, 2019 Share Posted October 26, 2019 In the past I’ve used HTTrack Website Copier for something similar. 1 Quote Link to comment Share on other sites More sharing options...
+Larry Posted October 26, 2019 Author Share Posted October 26, 2019 "But one thing that could work is to just quote everything on a page then block copy it to your Word document. A bit dirty but might be sufficient." That's a good idea! And that way, I can easily edit out OT replies. I didn't know you could do multiple quotes. Had not heard of HTTrack. I'll check that out. Quote Link to comment Share on other sites More sharing options...
dmsc Posted October 26, 2019 Share Posted October 26, 2019 Hi! 1 hour ago, Larry said: "But one thing that could work is to just quote everything on a page then block copy it to your Word document. A bit dirty but might be sufficient." That's a good idea! And that way, I can easily edit out OT replies. I didn't know you could do multiple quotes. Had not heard of HTTrack. I'll check that out. You could cleanup the HTML before copy or print the content, for example, if you type this in the JavaScript console in Chrome (CONTROL-SHIFT-J) or Firefox (CONTROL-SHIFT-K), you only get the posts and author names: h=''; document.querySelectorAll('article.cPost').forEach((n) => h=h+n.outerHTML); document.body.innerHTML = h; document.querySelectorAll('.ipsItemControls, ul.cAuthorPane_info').forEach((n)=>n.outerHTML=''); The above select from the page all "articles.cPost" elements, then makes a new page with only that content and then selects the ".ipsItemControls" (the "+" and "QUOTE" at the end of each post) and the "ul.cAuthorPane_info" (the photo and information of the authors) and removes that. The result is a much cleaner page to save or print. Note than in Firefox you have to first type "allow pasting" before you can actually past any code. Have Fun! Quote Link to comment Share on other sites More sharing options...
Mclaneinc Posted October 26, 2019 Share Posted October 26, 2019 HHTrack is a web site copier and very good but if you use it make sure you turn off external links otherwise it will copy any linked sites too... Quote Link to comment Share on other sites More sharing options...
777ismyname Posted October 26, 2019 Share Posted October 26, 2019 6 hours ago, Mclaneinc said: HHTrack is a web site copier and very good but if you use it make sure you turn off external links otherwise it will copy any linked sites too... ^^^ or limit the depth that it will crawl. I'm not sure that I've ever tried on the entire file, but it may be possible to download the thread with HHTrack and print to a PDF file? Quote Link to comment Share on other sites More sharing options...
Mclaneinc Posted October 27, 2019 Share Posted October 27, 2019 (edited) Yup, that is the phrase I could not remember, not sure the software will make a PDF but obviously its possible manually.. Tried to do Atarimania about 10yrs ago, damn robot protection on there One day Atari Frog will do an all in one zip file for the software...I'll keep moaning at him for it (and he will keep saying no) But I'll wear him down Edited October 27, 2019 by Mclaneinc Quote Link to comment Share on other sites More sharing options...
+CyranoJ Posted October 27, 2019 Share Posted October 27, 2019 Fireshot Quote Link to comment Share on other sites More sharing options...
Rybags Posted October 27, 2019 Share Posted October 27, 2019 The problem with Fireshot is it seems to be image based - I saved as PDF and not being able to copy bits of text hugely limits the usefullness. Quote Link to comment Share on other sites More sharing options...
Mclaneinc Posted October 27, 2019 Share Posted October 27, 2019 I use a thing on Chrome called Webscrapbook, pretty sure its on Firefox as well called Scrapbook.. Saves the page or portions thereof as html.. Quote Link to comment Share on other sites More sharing options...
Rybags Posted October 27, 2019 Share Posted October 27, 2019 (edited) Tried a couple of alternatives... I've got Foxit PDF Creator, it does a reasonable job of converting a saved page from Firefox (using page 2 of the C64 Keystone Kapers thread as a test case) - some rendering errors. Firefox extension "Save PDF" - white background and not great rendering but readble. There's online converters also, this one did a good job though since it's accessing the page without being logged in you're getting the default theme and possibly missing other things https://www.web2pdfconvert.com/ The other disadvantage is that it's really slow though saving then locally converting a page at a time would probably take just as long. Warning about online PDF converters - some dodgy ones around that have popups and nasties. Edited October 27, 2019 by Rybags Quote Link to comment Share on other sites More sharing options...
baktra Posted October 28, 2019 Share Posted October 28, 2019 Perhaps you can put together something using Selenium and HTMLUnit driver to harvest a topic. The trouble is writing it in a way that is resistant enough to changes. Of course, functionality integrated in the forum code would be great. But that costs development efforts and thus ... currency too. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.