Similar Threads for Man Pages - In Development


Login or Register for Dates, Times and to Reply

 
Thread Tools Search this Thread
The Lounge What is on Your Mind? Similar Threads for Man Pages - In Development
# 8  
Here is where we stand, looking only at the man pages under 4000 bytes for similarthreads:
Code:
mysql> connect unixmanpages;select count(1) as count from neo_man_page_entry where similarthread = "nopagetextmatch" or similarthread = "notagsmatch" and strlen < 4000; select count(1) as count from neo_man_page_entry where strlen < 4000;
Connection id:    26801183
Current database: unixmanpages

+-------+
| count |
+-------+
| 94967 |
+-------+
1 row in set (1.09 sec)

+--------+
| count  |
+--------+
| 204819 |
+--------+
1 row in set (0.03 sec)

This means that 46% of all man pages under 4000 bytes are similar thread orphans.

Look at all man pages:

Code:
mysql> connect unixmanpages;select count(1) as count from neo_man_page_entry where similarthread = "nopagetextmatch" or similarthread = "notagsmatch" ; select count(1) as count from neo_man_page_entry;
Connection id:    26806490
Current database: unixmanpages

+--------+
| count  |
+--------+
| 166050 |
+--------+
1 row in set (1.09 sec)

+--------+
| count  |
+--------+
| 347938 |
+--------+
1 row in set (0.00 sec)

mysql>

This means that ~48% of all man pages are similar thread orphans.

Next, I will work on how to reduce the under 4000 byte man page similar thread orphans even further. I think I will work some kind of os (operating system) match, just to reduce the orphans so they have some similar thread friends.
# 9  
Done for the under 4000 byte orphans:

Code:
mysql> connect unixmanpages;select count(1) as count from neo_man_page_entry where similarthread = "nopagetextmatch"  and strlen < 4000; select count(1) as count from neo_man_page_entry;
Connection id:    27647119
Current database: unixmanpages

+-------+
| count |
+-------+
| 87617 |
+-------+
1 row in set (0.48 sec)

+--------+
| count  |
+--------+
| 347938 |
+--------+
1 row in set (0.00 sec)

25%...
# 10  
Update:

With the remaining batch of under 4000 byte man page orphans without similar threads, I matched the os against the text in the posts and ordered the similar thread results by the number of "thread thanks" and now there are zero orphans.

So, I am calling this project "done" for now.
# 11  
Setting data point for future reference:

Code:
Total Linux Man Pages in DB: 145,728
Total Indexed by Google (GSC): 19.287

Total Unix Man Pages in DB: 133,279
Total Indexed by Google (GSC): 12,235

Linux Man Page Index Coverage: 13%
Unix  Man Page Index Coverage: 9%

Let's see if this improves and by how much in 2020, based on all the work I did on this in 2019.
Login or Register for Dates, Times and to Reply

Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

Test Your Knowledge in Computers #389
Difficulty: Medium
Rocky Mountain BASIC (RMB) is a dialect of the BASIC programming language created by AIX.
True or False?

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Commands for man pages

what command should i use for displaying the manual pages for the socket, read and connect system calls? (1 Reply)
Discussion started by: Nabeel Nazir
1 Replies

2. HP-UX

Looking for some man pages.

Can anyone supply me with the man pages for: omnidatalist omnibarlist omnisap.exe I prefer the source man pages in nroff format. A clue about the software bundles which supply these man pages is fine as well. OS: HP-UX TIA (11 Replies)
Discussion started by: sb008
11 Replies

3. Solaris

MAN PAGES

Hi everyone, I have a small query, in solaris the man pages get displayed on half of the terminal , can i get a full terminal or full screen display ?:) (2 Replies)
Discussion started by: M.Choudhury
2 Replies

4. Fedora

why do we have .1 extension in MAN PAGES?

Hello sir, I am using FEDORA 9. I wanted to know why do we have ".1" extension in the archives of man pages. I know we are giving format. I want to know the importance or purpose of this format. Can you please tell me :confused: (2 Replies)
Discussion started by: nsharath
2 Replies

5. UNIX for Dummies Questions & Answers

Man pages on Solaris 10

Hi, I want to install man pages package from solaris 10. Solaris 10 has already been installed on my servor but I have to add the man pages packages. I search for a long time on internet this package but I didn't find a compatible one... So I downloaded Solaris 10 from Sun site to get this... (1 Reply)
Discussion started by: MasterapocA
1 Replies

6. UNIX for Dummies Questions & Answers

how to read man pages

can anybody explain me how to read unix man pages? for example when i want to get information about ps command man ps gives me this output: *********************************** Reformatting page. Please wait... completed ps(1) ... (2 Replies)
Discussion started by: gfhgfnhhn
2 Replies

7. UNIX for Dummies Questions & Answers

man pages

When reading man pages, I notice that sometimes commands are follwed by a number enclosed in parenthesis. such as: mkdir calls the mkdir(2) system call. What exactly does this mean? (4 Replies)
Discussion started by: dangral
4 Replies

8. UNIX for Dummies Questions & Answers

man pages

Hi folks, I want to know all the commands for which man pages are available. How do i get it? Cheers, Nisha (4 Replies)
Discussion started by: Nisha
4 Replies

9. UNIX for Dummies Questions & Answers

man pages

Hi, I've written now a man pages, but I don't knwo how to get 'man' to view them. Where have I to put this files, which directories are allowed?? THX Bensky (3 Replies)
Discussion started by: bensky
3 Replies

10. UNIX for Dummies Questions & Answers

Man pages

Hello , I just installed openssh in my system . I actually tried to man sshd but it says no entry , though there is a man directory in the installation which have the man pages for sshd . Can anyone tell me how should i install these man pages . DP (2 Replies)
Discussion started by: DPAI
2 Replies

Featured Tech Videos