OSDev.org

The Place to Start for Operating System Developers
It is currently Thu Mar 28, 2024 5:25 am

All times are UTC - 6 hours




Post new topic Reply to topic  [ 35 posts ]  Go to page 1, 2, 3  Next
Author Message
 Post subject: How to get the wiki for offline browing?
PostPosted: Tue Apr 24, 2012 8:47 am 
Offline

Joined: Tue Apr 24, 2012 8:38 am
Posts: 4
Hi all,

My first post (and may be the single, who know :mrgreen: ) here. By the way, funny captcha question (the one about source register for movsb).

I was just wondering if there exist an archive of the wiki, for offline browsing. I can't cleanly get it with wget, as I could get all "garbage" pages, like edit, history and the like.

I know it may be updated frequently, but I don't mind, as I like it very much as it already is :D

Have a nice time all =D>


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Tue Apr 24, 2012 10:08 am 
Offline

Joined: Tue Apr 24, 2012 8:38 am
Posts: 4
Perfect! There is a ZIP archive for the Wiki, without histories and others, just articles and talks pages (the latter being important too, after‑all).

I Thanks you.


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Wed May 02, 2012 12:30 pm 
Offline

Joined: Wed May 02, 2012 12:22 pm
Posts: 1
But the ZIP archive at http://files.osdev.org/osdev_wiki.zip seems to be dated:
It's date is 30-Dec-2011. To be sure I've checked some changes newer than this date.
None of them I've checked is included in the ZIP archive.

Something wrong with archiving the wiki?

Dirk


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Wed May 02, 2012 2:42 pm 
Offline
Member
Member
User avatar

Joined: Wed Jul 13, 2011 7:38 pm
Posts: 558
I'd assume it's manually updated.

Can you curl excluding pages with "&action=edit" and "&action=history" in the address? If so, there's your solution.


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Thu May 03, 2012 9:54 am 
Offline
Member
Member
User avatar

Joined: Fri Mar 07, 2008 5:36 pm
Posts: 2111
Location: Bucharest, Romania
To my knowledge, that archive is *much* older. The date you mentioned is the one when chase changed his hosting provider.

_________________
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Mon May 07, 2012 2:44 am 
Offline
Member
Member
User avatar

Joined: Thu Nov 16, 2006 12:01 pm
Posts: 7612
Location: Germany
There's Special:AllPages and Special:Export. Between them, you should be able to grab the Wiki contents, and either store them into an offline Wiki or for transformation from XML into something more readable. No automated function for "grab it all" exists, to my knowledge. (I really don't like MediaWiki, and this gets another notch in the "dislike" list.)

_________________
Every good solution is obvious once you've found it.


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Mon May 07, 2012 8:16 am 
Offline
Member
Member
User avatar

Joined: Fri Mar 07, 2008 5:36 pm
Posts: 2111
Location: Bucharest, Romania
Can you imagine what would happen if enough people tried to download Wikipedia at the same time? At any rate, I think wiki's are not meant for offline reading---the whole point of them is that they can be in a constant state of change.

_________________
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Mon May 07, 2012 8:25 am 
Offline
Member
Member
User avatar

Joined: Thu Nov 16, 2006 12:01 pm
Posts: 7612
Location: Germany
Are you seriously comparing our OSDev Wiki with Wikipedia? :shock: :wink:

(Before there was MediaWiki, we were using PhpWiki, which provided a nice download option, which could be disabled / enabled by the admin.)

_________________
Every good solution is obvious once you've found it.


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Mon May 07, 2012 8:33 am 
Offline
Member
Member
User avatar

Joined: Fri Mar 07, 2008 5:36 pm
Posts: 2111
Location: Bucharest, Romania
Ah, no. Those were just my $0.02 regarding your comment on MediaWiki:

Solar wrote:
(I really don't like MediaWiki, and this gets another notch in the "dislike" list.)


I do have some criticisms of my on but those are off-topic.

_________________
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Mon Jun 11, 2012 9:52 pm 
Offline
Site Admin
User avatar

Joined: Wed Oct 20, 2004 10:46 pm
Posts: 684
Location: Texas
The DumpHTML extension is broken, hence no more backups.

I've switched over to using wget in a weekly cron job.

Code:
wget --mirror -k -p --reject '*=*,User:*,Special:*,User_talk:*' --exclude-directories='User:*,User:*/*,User:*/*/*,User_talk:*,User_talk:*/*,User_talk:*/*/*,Special:*,Special:*/*,Special:*/*/*' --user-agent="osdev-mirror" http://wiki.osdev.org/Main_Page


It ends up getting a couple of duplicate pages (if you are on Windows) because some of the pages are archived twice with different case but overall I think it should be a functional offline-copy.

If you guys can improve the wget command or have a better linux alt (maybe curl) let me know. Any additional shell scripting/sed/awk/etc to fix up any broken stuff would also be welcome.

The result is: http://files.osdev.org/osdev_wiki.zip


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Sat Jul 21, 2012 12:35 am 
Offline
Member
Member
User avatar

Joined: Tue Sep 07, 2010 11:00 pm
Posts: 55
Any possible fix to DumpHTML? Anyone have a pre-broken dump? Can't use the new ones in their form. ;_; *should never have deleted his copy, didn't know*

_________________
“...No rest, no peace...” ― Odin Vex


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Sat Jul 21, 2012 4:03 pm 
Offline
Site Admin
User avatar

Joined: Wed Oct 20, 2004 10:46 pm
Posts: 684
Location: Texas
What is broken in the current dump? (or what don't you like about it)


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Sat Jul 21, 2012 4:37 pm 
Offline
Member
Member
User avatar

Joined: Wed Oct 18, 2006 3:45 am
Posts: 9301
Location: On the balcony, where I can actually keep 1½m distance
You can't click category links (i.e. pretty much everything on the categorized main page) in at least firefox and epiphany which is rather annoying as it breaks pretty much all of the indexing. Other than that, things work mostly fine here.

_________________
"Certainly avoid yourself. He is a newbie and might not realize it. You'll hate his code deeply a few years down the road." - Sortie
[ My OS ] [ VDisk/SFS ]


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Sat Jul 21, 2012 7:37 pm 
Offline
Member
Member
User avatar

Joined: Tue Sep 07, 2010 11:00 pm
Posts: 55
Filesystem-structure is entirely different, uses page-titles instead of HTML pages that can properly link. :( No link will work practically.

_________________
“...No rest, no peace...” ― Odin Vex


Top
 Profile  
 
 Post subject: Re: How to get the wiki for offline browing?
PostPosted: Sat Aug 04, 2012 5:14 pm 
Offline

Joined: Sat Aug 04, 2012 5:05 pm
Posts: 4
hi,
I read the entire post and much more... but I still don't know how to READ the offline archive!
The uncompressed files have no extension, there is not index.php or .html
sorry to be fool, I feel I'm missing something obvious but please I'm completly lost :cry:

I go vacation in two days, and I really want to bring this marvellous Wiki with me, please help me!

PS: forgive my poor english, I don't speak it very well, and now I'm very tired.

Thanks folks.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 35 posts ]  Go to page 1, 2, 3  Next

All times are UTC - 6 hours


Who is online

Users browsing this forum: No registered users and 20 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group