XML Split of a Large file

QXMLEdit has a dedicated function for that: I used it successfully with a Wikipedia dump. The ~2.7Gio file became a bunch of ~1 400 000 files (one per page). It even allows you to dispatch them in subfolders.


XmlSplit - A Command-line Tool That Splits Large XML Files

  • http://xponentsoftware.com/xmlSplit.aspx

xml_split - split huge XML documents into smaller chunks

  • http://www.perlmonks.org/index.pl?node_id=429707
  • http://metacpan.org/pod/XML::Twig

Split that XML by bhayanakmaut (No source code and I could not get this one working)

  • http://sourceforge.net/projects/splitthatxml/

A similar question: How do I split a large xml file?


I think you'll have to split manually unless you are interested in doing it programmatically. Here's a sample that does that, though it doesn't mention the max size of handled XML files. When doing it manually, the first problem that arises is how to open the file itself.

I would recommend a very simple text editor - something like Vim. When handling such large files, it is always useful to turn off all forms of syntax highlighting and/or folding.

Other options worth considering:

  1. EditPadPro - I've never tried it with anything this size, but if it's anything like other JGSoft products, it should work like a breeze. Remember to turn off syntax highlighting.

  2. VEdit - I've used this with files of 1GB in size, works as if it were nothing at all.

  3. EmEditor

Tags:

Xml