Is there some *easy* way to batch convert wordpress blog entries into plain HTML that keeps the same URLs?
-
Is there some *easy* way to batch convert wordpress blog entries into plain HTML that keeps the same URLs? I don’t want to break people’s links.
I can handle plain HTML. I have BBEdit and I know how to use it mostly (my grep is pretty minimal). (My tags and categories are already messed up, so they’d need editing anyway.)
Failing that, is there some way to import them into some other site management system?
2/2
-
@gannet do any of these helps?
6 WordPress to HTML Converter Plugins and Tools - White Canvas
Turn your WordPress site into a static, HTML, JavaScript, and CSS-based website today with these WordPress to HTML converter tools.
White Canvas (wcanvas.com)
I've personally been looking to convert to markdown. But I'm not quite there yet energy wise.
-
@gannet I do something like this occasionally for people with old/defunct WP sites where they want to keep the content alive, but don't want to have to deal with keeping WP and all its requirements.
The tool I use is HTTrack. You can point it at any live web site, and it will create a completely static mirror of that site on your hard drive. It preserves things like directory structures, so if you use "pretty permalinks" for your WP post URLs, those URLs should stay intact. Once you have the static mirror, you can throw it up on any old static file hosting, or modify the HTML files by hand if you want to.
HTTrack Website Copier - Free Software Offline Browser (GNU GPL)
HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility. It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the 'mirrored' website in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads. HTTrack is fully configurable, and has an integrated help system. WinHTTrack is the Windows 2000/XP/Vista/Seven/8 release of HTTrack, and WebHTTrack the Linux/Unix/BSD release.
(www.httrack.com)
-
@jalefkowit @gannet omg this looks so useful
-
Jason Lefkowitzreplied to Jason Lefkowitz last edited by
@gannet If you want to migrate the content into another CMS, WP provides a way to do that, kinda. There's a function in the "Tools" menu called "Export" that will dump all your site content into a file using an XML-based format WordPress calls WXR. So when you have that, the question to ask the other CMS is if it has a way to handle WXR imports.
WXR is poorly documented, which sucks. But it's at least human-readable, and if a CMS these days is going to support content import from anything, it should be from WordPress.
Let's talk about WXR! · WordPress data-liberation · Discussion #56
Let's talk about WXR!
GitHub (github.com)
-
Jason Lefkowitzreplied to metasilk last edited by [email protected]
@metasilk @gannet It's great. You can take a whole WP site and turn it into a static archive you can host from an S3 bucket for pennies. Terrific for older sites that aren't getting active updates anymore.
The only caveat is that features that rely on server-side functionality (like search, for instance) won't work. But you can solve that by running a regex against the static files to remove the search box