GuildWiki talk:Database dumps

Size
How about specifying the size of the dumps as well? All articles = 37.44MB, build articles = 5.62MB. -- Ab.Er.Rant (msg Aberrant80) 22:50, 4 May 2007 (CDT)
 * Shoot, I knew I forgot to copy something over! &mdash;Tanaric 00:06, 5 May 2007 (CDT)

Images
What about images, like maps? They don't seem to be in the dump. Do I use a modified Wikix program? That seems pretty server-hostile. 85.214.63.253 14:00, 6 May 2007 (CDT)


 * That would be moderately server-hostile, yes. Do it at night, if you need to. &mdash;Tanaric 19:10, 6 May 2007 (CDT)


 * OK, I'd like to get the images, also, but I don't want to abuse the server. I'd also like to get the images within my lifetime (there are well over 100000 images).  So, before I do anything, how about this?: between 12M and 5:59AM, images get sucked down at an average rate of 1 per second.  Between 6AM and 11:59PM, images get sucked down at an average rate of 1 every 5 seconds.  With this, I'm guessing that it'll take 3-4 days to get all the images.  Note that Wikix (which I would not be using) is a lot more hostile than this, as it just sucks the images down as fast as it can (and, since it's designed to suck from wikipedia's massive farm, there's an option to do 16 parallel suckers ;-).  &mdash;MooMishka 02:53, 21 May 2007 (CDT)


 * Midnight to 6AM is best if you mean GMT-5 (CDT). Adjust them if you meant a different time zone. --Fyren 03:15, 21 May 2007 (CDT)

Problem reading dump
Hmm, I can't seem to import the dump in my Mediawiki installation. I get the message 'Unknown import source type'. Anyone succeeded in importing these dumps? Of course I extracted the xml files. They look ok at a glance. --Toxaris 06:05, 12 May 2007 (CDT)
 * That error doesn't mean the dump is bad but either that something is messed up with your MW install or you're just doing something wrong. The "import source" for Special:Import is supposed to either be a file upload or an interwiki transfer.  For example, if you view Special:Import on your wiki and look at the HTML for the page, you can see a hidden field in the form with name "source" and value "upload".  You can look at the code in includes/SpecialImport.php for where it checks the value against "upload" and "transwiki" but shows that error if it's not either.  --Fyren 06:28, 12 May 2007 (CDT)
 * I had no problem reading the XML data into MW 1.9.3. However, the standard method is slow (which is documented and known).  Other problems include (1) needing additional MW extensions (undocumented), and (2) images aren't included in the XML dumps.  I hate to say this, but you really do need advanced MW knowledge to deal with the XML.  192.26.10.2 05:24, 20 May 2007 (CDT)


 * We have no problem with this -- we are under no obligation to provide a tutorial on how to use these. That said, if anybody would like to write such a tutorial, that would be very helpful. Make a subpage with your proposed change and it will be merged upon admin approval. &mdash;Tanaric 19:25, 20 May 2007 (CDT)


 * No one said that you're under an obligation. He's just saying that, to use the XML right now, you need fairly advanced knowledge.  This is because there are no documentation/tutorial and programs (which neither you nor anyone else are obligated to provide, although, perhaps, someone will).  This is not a complaint -- merely an observation.  &mdash;MooMishka 03:07, 21 May 2007 (CDT)


 * meta:Data dumps, there's your documentation. The importDump.php method works fine with the GuildWiki dumps, other methods listed in that page probably do too.
 * 192.26.10.2, 1) Special:Version (ParserFunctions is the only extension you need for the dump to work properly). 2) it's a database dump... --Dirigible 03:42, 21 May 2007 (CDT)


 * While that certainly is "documentation", you're kinda proving my point that advanced knowledge is needed. Most users would not know what to do with that.  On the other hand, we might not want to make it easy, because, invariably, some novice user will hammer the servers.  Of course, it might make more sense to create a DVD containing xampp and the fully-processed XML w/images.  That would probably be much more useful to people, but the bandwidth issues are problematic.  Bittorrent, perhaps? &mdash;MooMishka 16:23, 21 May 2007 (CDT)


 * And, yes, I know that it's a database dump. However, to be truly useful, it needs images, and my handy-dandy gentle image sucker will soon be getting them for me.  :-)  &mdash;MooMishka 16:45, 21 May 2007 (CDT)


 * If you provide me an archive of the images and some meaningful documentation on how to integrate that archive with an imported DB dump, I'd be happy to add it here. &mdash;Tanaric 18:39, 21 May 2007 (CDT)


 * Oh, blarg. Some image filenames can't be handled by windows, such as those with double quotes (e.g., for the '"You Will Die!"' skill, etc.).  So much for a nice windows DVD (this doesn't bother me too much, as my main mediawiki server is running under FreeBSD ;-).  &mdash;MooMishka 00:02, 22 May 2007 (CDT)


 * I'm running Ubuntu, so I'd still like such an archive. :) &mdash;Tanaric 19:43, 22 May 2007 (CDT)


 * Well, I think I've downloaded most of the images. I thought there were ~100000, but there only seems to be a bit more than 15000 (I'm still trying to figure out how I got that 100000 number).  I wasn't able to get around 800 images, and many/most of those appear to be just plain missing (example: see the missing images on this page: http://gw.gamewikis.org/wiki?title=Dervish_Ancient_Armor/Male&oldid=520525 ).  Others are user images, which I don't really need.  :-) Anyway, I've got the XML w/images loaded up, and everything seems to be working well.  Just had to run rebuildImages.php twice, as documented in the mediawiki docs, and revert the main page (because the wiki was installed after the XML was created, the wiki's default main page, which gets an update date of when the wiki is installed, took precedence over the GW one). The images occupy around 600MB (without thumbnails), and a zip file is around 113MB (a .tar.bz2 file is about the same size). Later this week, I'm going to try creating a windows xampp distribution (non-DVD), after renaming the problematic images.  I'm guessing that this will use up around 1.2GB of space.


 * If you're willing, send me an email with instructions on how I should get the non-Windows zip from you. I can provide an FTP account if needed. &mdash;Tanaric 02:04, 23 May 2007 (CDT)


 * Mail sent. &mdash;MooMishka 14:02, 23 May 2007 (CDT)


 * Boo. Some of the image names are computed (e.g., from skill and boss names, etc.).  I don't have these.  I'll have to figure out a way of extracting and downloading these.  &mdash;MooMishka 01:31, 25 May 2007 (CDT)

Broken image link
The URL for the image files is broken. I think you meant, "guildwiki-images-unix-2007-05-01.zip", and not "guildwiki-images-unix-2007-05-1", ;-)  &mdash;MooMishka 22:22, 29 May 2007 (CDT)
 * Try it now :p -Auron [[Image:Elit Druin.jpg|19px||My Talk]] 22:28, 29 May 2007 (CDT)


 * Oops. &mdash;Tanaric 23:17, 29 May 2007 (CDT)

I no longer have sysop rights...
..so I can't update this article. If any sysops are willing to link in the new dumps, I'd appreciate it. &mdash;Tanaric 18:50, 29 September 2007 (UTC)
 * Done, let me know if you want me to add/remove any or change the formatting. --Xasxas256 07:30, 30 September 2007 (UTC)
 * Please move the images dump to the images subheading, and move the all revisions dump to the Complete dumps subheading. Thanks! &mdash;Tanaric 11:06, 30 September 2007 (UTC)
 * Fixed (I think). --Xasxas256 11:21, 30 September 2007 (UTC)

before I forget...
From what I've heard, database dumps will also be available from Wikia at http://wikistats.wikia.com/dbdumps/dbdumps.html once GuildWiki's stuff has been fully integrated with Wikia's system. Bringing this up now instead of later in case I lose track of the link or forget about it when it does become available... -User:PanSola (talk to the ) 11:46, 30 September 2007 (UTC)
 * Yeah, the wikia page that explains this is http://www.wikia.com/wiki/Database_dump. However, wikistats.wikia.com is always busy. :-( Maybe some admin should add this info to the article (hint, hint). --mendel 07:52, 11 June 2008 (UTC)
 * Should we completely get rid of the old dump info? Those dumps are 9 months old, so they probably aren't very useful anymore.  Except maybe the image archive - do we know if Wikia makes images available with their db dumps?  &mdash;Dr Ishmael [[Image:Diablo_the_chicken.gif]] 15:10, 11 June 2008 (UTC)
 * These are the most current dumps we have. For historologic reasons, I want to leave a note that historic dumps from before the builds wipe and the wikia move respectively are available from Tanaric's website. --mendel 15:25, 11 June 2008 (UTC)

Wikia DB dumps
Wikistats is working again, our dumps are here:

List of all database dumps
 * pages_full.xml.gz ~1GB
 * pages_current.xml.gz ~60MB

AFAIK, these do not include images - the pages_current DB doesn't, although the text for the images is there. --mendel 22:25, 19 June 2008 (UTC)


 * Interesting that we're the third or fourth largest wiki there, depending on whether one counts by pages_full or pages_current, trailing only Star Wars, Uncyclopedia, and possibly WoWwiki. Quizzical 23:11, 19 June 2008 (UTC)


 * Looking at the tables on http://wikistats.wikia.com/EN/Sitemap.htm, if you go by word count and database size, the wikis INCICLOPEDIA, NONCICLOPEDIA, FFXI definitely have us beat as well (there may be others), but maybe they compress better, possibly because they duplicate information? We have about 0.8% of all words and database space on Wikia, and approximately 0.4% of the pageviews. See my userpage for statistics links. --mendel 23:46, 19 June 2008 (UTC)