Search Results

Search: Posts Made By: stargazerr
25,231
Posted By stargazerr
Actually, I need to be able to count all...
Actually,

I need to be able to count all characters on each line (before a line break), in a file and also to be able to remove all spaces in a line but not the line breaks, so

this is a line...
25,231
Posted By stargazerr
Thanks a lot guys :) .. One more favor,...
Thanks a lot guys :) ..

One more favor, this method of yours counts the number of words on each line .. Can we count the number of characters too ?

Thanks
25,231
Posted By stargazerr
Help with wc and line breaks
Hi everyone,

I have gone through the forum trying to find an answer to this question but was unsuccessful. I am hoping that someone can help me with this please.

I am trying to get my script to...
8,329
Posted By stargazerr
sed to extract HTML content
Hiya,

I am trying to extract a news article from a web page. The sed I have written brings back a lot of Javascript code and sometimes advertisments too. Can anyone please help with this one ??? I...
8,691
Posted By stargazerr
cfajohnson Not to worry .. :) .. I have...
cfajohnson

Not to worry .. :) .. I have fixed it, I think. Doing a few more tests to see if my toggle works.

Thank you again :) .. This was awesome.

SG

Kevin

hmmm .. you have a point...
8,691
Posted By stargazerr
That is all one file .. But I toggled the text...
That is all one file .. But I toggled the text (deleted a few of the, this is a test lines) so that the longest common phrase was yahooo lala lala .. but it was not picked up. :(

I will see if I...
8,691
Posted By stargazerr
cfajohnson .. :) .. That was AWESOME .. :D .....
cfajohnson .. :) .. That was AWESOME .. :D .. Thank you a million :)

One question though. It is only searching for phrases that are made up of 4 words and above. I don't see the loop init being...
8,691
Posted By stargazerr
lol .. okk danmero .. Here is one of the text...
lol .. okk danmero .. Here is one of the text files I am working from, as an example. :) .. After removal of HTML tags and extraction of only the content i require, i.e, title, author, date and...
8,691
Posted By stargazerr
Umm .. where would that be ?? I just posted under...
Umm .. where would that be ?? I just posted under shell scripts .. Didnt know there was an advanced section.
8,691
Posted By stargazerr
Yes. I agree. which is why i am asking here if...
Yes. I agree. which is why i am asking here if anyone can think of a more feasible way .. :)

The search is limited to news articles though .. So this script that i am designing will run only on...
8,691
Posted By stargazerr
hahaha .. I know exactly what you mean Kevin .....
hahaha .. I know exactly what you mean Kevin .. :) .. I wish I had a choice too :P .. But I need this as added functionality to the search engine I am designing ..

Read my post above, the for...
8,691
Posted By stargazerr
A phrase is a collection of n number of words...
A phrase is a collection of n number of words that are the same. Could be anything from 2 to n. So for example in a sample text:

The quick brown fox jumped over the ugly dog
The quick brown fox...
8,691
Posted By stargazerr
As an idea .. I was thinking of maybe counting...
As an idea .. I was thinking of maybe counting the total number of words in the text file and then running a for loop to that number to check for n-grams.

I haven't yet tried this idea. Right now...
8,691
Posted By stargazerr
The text file is nothing but a news article after...
The text file is nothing but a news article after parsing HTML tags and extracting the content using XML. I use the following to extract and print the most bigrams for now.

tr -sc 'a-zA-z0-9.'...
8,691
Posted By stargazerr
Shell script to find longest phrase
Hi Everyone,

I am trying to write a shell script that can find the longest phrase that appears at least twice in an online news article. The HTML has been parsed through an HTML parser, converted...
Showing results 1 to 15 of 15

 
All times are GMT -4. The time now is 05:56 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy