GuildWiki talk:Database dumps

Size
How about specifying the size of the dumps as well? All articles = 37.44MB, build articles = 5.62MB. -- Ab.Er.Rant (msg Aberrant80) 22:50, 4 May 2007 (CDT)
 * Shoot, I knew I forgot to copy something over! &mdash;Tanaric 00:06, 5 May 2007 (CDT)

Images
What about images, like maps? They don't seem to be in the dump. Do I use a modified Wikix program? That seems pretty server-hostile. 85.214.63.253 14:00, 6 May 2007 (CDT)


 * That would be moderately server-hostile, yes. Do it at night, if you need to. &mdash;Tanaric 19:10, 6 May 2007 (CDT)


 * OK, I'd like to get the images, also, but I don't want to abuse the server. I'd also like to get the images within my lifetime (there are well over 100000 images).  So, before I do anything, how about this?: between 12M and 5:59AM, images get sucked down at an average rate of 1 per second.  Between 6AM and 11:59PM, images get sucked down at an average rate of 1 every 5 seconds.  With this, I'm guessing that it'll take 3-4 days to get all the images.  Note that Wikix (which I would not be using) is a lot more hostile than this, as it just sucks the images down as fast as it can (and, since it's designed to suck from wikipedia's massive farm, there's an option to do 16 parallel suckers ;-).  &mdash;MooMishka 02:53, 21 May 2007 (CDT)


 * Midnight to 6AM is best if you mean GMT-5 (CDT). Adjust them if you meant a different time zone. --Fyren 03:15, 21 May 2007 (CDT)

Problem reading dump
Hmm, I can't seem to import the dump in my Mediawiki installation. I get the message 'Unknown import source type'. Anyone succeeded in importing these dumps? Of course I extracted the xml files. They look ok at a glance. --Toxaris 06:05, 12 May 2007 (CDT)
 * That error doesn't mean the dump is bad but either that something is messed up with your MW install or you're just doing something wrong. The "import source" for Special:Import is supposed to either be a file upload or an interwiki transfer.  For example, if you view Special:Import on your wiki and look at the HTML for the page, you can see a hidden field in the form with name "source" and value "upload".  You can look at the code in includes/SpecialImport.php for where it checks the value against "upload" and "transwiki" but shows that error if it's not either.  --Fyren 06:28, 12 May 2007 (CDT)
 * I had no problem reading the XML data into MW 1.9.3. However, the standard method is slow (which is documented and known).  Other problems include (1) needing additional MW extensions (undocumented), and (2) images aren't included in the XML dumps.  I hate to say this, but you really do need advanced MW knowledge to deal with the XML.  192.26.10.2 05:24, 20 May 2007 (CDT)


 * We have no problem with this -- we are under no obligation to provide a tutorial on how to use these. That said, if anybody would like to write such a tutorial, that would be very helpful. Make a subpage with your proposed change and it will be merged upon admin approval. &mdash;Tanaric 19:25, 20 May 2007 (CDT)


 * No one said that you're under an obligation. He's just saying that, to use the XML right now, you need fairly advanced knowledge.  This is because there are no documentation/tutorial and programs (which neither you nor anyone else are obligated to provide, although, perhaps, someone will).  This is not a complaint -- merely an observation.  &mdash;MooMishka 03:07, 21 May 2007 (CDT)


 * meta:Data dumps, there's your documentation. The importDump.php method works fine with the GuildWiki dumps, other methods listed in that page probably do too.
 * 192.26.10.2, 1) Special:Version (ParserFunctions is the only extension you need for the dump to work properly). 2) it's a database dump... --Dirigible 03:42, 21 May 2007 (CDT)


 * While that certainly is "documentation", you're kinda proving my point that advanced knowledge is needed. Most users would not know what to do with that.  On the other hand, we might not want to make it easy, because, invariably, some novice user will hammer the servers.  Of course, it might make more sense to create a DVD containing xampp and the fully-processed XML w/images.  That would probably be much more useful to people, but the bandwidth issues are problematic.  Bittorrent, perhaps? &mdash;MooMishka 16:23, 21 May 2007 (CDT)


 * And, yes, I know that it's a database dump. However, to be truly useful, it needs images, and my handy-dandy gentle image sucker will soon be getting them for me.  :-)  &mdash;MooMishka 16:45, 21 May 2007 (CDT)


 * If you provide me an archive of the images and some meaningful documentation on how to integrate that archive with an imported DB dump, I'd be happy to add it here. &mdash;Tanaric 18:39, 21 May 2007 (CDT)


 * Oh, blarg. Some image filenames can't be handled by windows, such as those with double quotes (e.g., for the '"You Will Die!"' skill, etc.).  So much for a nice windows DVD (this doesn't bother me too much, as my main mediawiki server is running under FreeBSD ;-).  &mdash;MooMishka 00:02, 22 May 2007 (CDT)


 * I'm running Ubuntu, so I'd still like such an archive. :) &mdash;Tanaric 19:43, 22 May 2007 (CDT)


 * Well, I think I've downloaded most of the images. I thought there were ~100000, but there only seems to be a bit more than 15000 (I'm still trying to figure out how I got that 100000 number).  I wasn't able to get around 800 images, and many/most of those appear to be just plain missing (example: see the missing images on this page: http://gw.gamewikis.org/wiki?title=Dervish_Ancient_Armor/Male&oldid=520525 ).  Others are user images, which I don't really need.  :-) Anyway, I've got the XML w/images loaded up, and everything seems to be working well.  Just had to run rebuildImages.php twice, as documented in the mediawiki docs, and revert the main page (because the wiki was installed after the XML was created, the wiki's default main page, which gets an update date of when the wiki is installed, took precedence over the GW one). The images occupy around 600MB (without thumbnails), and a zip file is around 113MB (a .tar.bz2 file is about the same size). Later this week, I'm going to try creating a windows xampp distribution (non-DVD), after renaming the problematic images.  I'm guessing that this will use up around 1.2GB of space.