×
Login Register an account
Top Submissions Explore Upgoat Search Random Subverse Random Post Colorize! Site Rules
9

SearchVoat Downloads

submitted by ruck_feddit to whatever 2.0 yearsMay 26, 2022 15:34:49 ago (+10/-1)     (whatever)

It would be beneficial for us to have a basic tutorial on how to use the data from SVF ourselves. Not a thousand step tutorial. Not a build yourself a database and query it vague tutorial. Give us the basic steps necessary to get the job done. Someone has to have an easier way for a normal non-it guy than what I'm doing.

I haven't seen a post covering this yet. Sorry if I missed it, but all the conversations have fallen into one of the two examples I listed above.


15 comments block


[ - ] Rotteuxx 7 points 2.0 yearsMay 26, 2022 17:13:10 ago (+7/-0)

Where the fuck is Virge with his paper copy of the internet when you need him.

[ - ] RMGoetbbels 2 points 2.0 yearsMay 26, 2022 17:35:15 ago (+2/-0)

This is why I miss you.

[ - ] system 5 points 2.0 yearsMay 26, 2022 15:48:22 ago (+5/-0)

I tried to download it. It was going at like 250kbs so I gave up. Has anyone actually downloaded it?

[ - ] chrimony 3 points 2.0 yearsMay 26, 2022 19:28:33 ago (+3/-0)

[ - ] ruck_feddit [op] 0 points 2.0 yearsMay 26, 2022 21:42:12 ago (+0/-0)

Thanks.

[ - ] deleted 2 points 2.0 yearsMay 26, 2022 18:40:29 ago (+2/-0)

deleted

[ - ] ruck_feddit [op] 1 point 2.0 yearsMay 26, 2022 16:09:58 ago (+1/-0)

As far as I know, all the files are still downloading on my end.

[ - ] Wahaha 1 point 2.0 yearsMay 26, 2022 17:22:43 ago (+1/-0)

Halfway through the download just died. Tried a couple of times, but eventually gave up.

[ - ] ParnellsUprising 0 points 2.0 yearsMay 26, 2022 22:36:49 ago (+0/-0)

Downloading it now.

Last 2 items left are comments & submissions. The 14 gb thumbnails file does not exist, which I probably would not have wasted my time on downloading anyways, unless it was a torrent.

From the code that I have looked at, it is a bit of a mess, so I'm not even sure I will spend any time on that(understand, I am not knocking searchvoat, just stating the obvious, this has been developed over years as a fun project, which looking at the code, is probably not that fun anymore).

I can most likely do something with the sql, and perhaps even give that some relational relational integrity in order to make it a bit more functional, though we shall see, as I really have not delved into the submissions and comments yet.

As far as OP's wish, there is no way that will happen.

I'll see if the DB can be cleaned up a bit in a more relational manner, but at this point is is a bit of a mess.

[ - ] GloryBeckons 0 points 2.0 yearsMay 26, 2022 23:01:00 ago (+0/-0)

You can get the thumbnails here:

https://searchvoat.co/gz/thumbnails.tgz
https://archive.org/download/searchvoat.co/thumbnails.tgz

The link in the OP over there just used the wrong extension.

[ - ] heygeorge 1 point 2.0 yearsMay 26, 2022 23:22:02 ago (+1/-0)

Learn to code!

[ - ] qwop 1 point 2.0 yearsMay 26, 2022 21:44:37 ago (+1/-0)*

There is no easy way. It's a raw dump of the website. It includes the PHP code and the SQL database files.

The simplest way to query the data is to install MariaDB as back-end and HeidiSQL as front-end if on Windows. Then load submissions.sql and comments.sql into the DB and you can run queries.

If you have normal IQ and able to use search engines to figure out basic computer problems, you should be able to figure it out. It's not entry-level difficulty, but not impossibly complicated either.

But to get a full working website (or even simple point-and-click) requires IT knowledge (PHP, SQL, Apache, and Unix/Linux command-line).

[ - ] lord_nougat 1 point 2.0 yearsMay 26, 2022 16:32:32 ago (+1/-0)

If I ran a full recursive wget on the site would I get all the things? Perhaps I should try.

[ - ] chrimony 1 point 2.0 yearsMay 26, 2022 19:29:32 ago (+1/-0)

You're just going to make the site slower. See my reply to system, the data is up on archive.org now.

[ - ] lord_nougat 0 points 2.0 yearsMay 26, 2022 20:04:23 ago (+0/-0)

Oh, neat! I figured as much, so I didn't even try.