12-22-2010
parsing file based on characters/bytes
I have a datafile that is formatted as fixed.
I know that each line should contain 880 characters.
I want to separate the file into 2 files, one that has lines with 880 characters and the other file with everything else.
Is this possible ?
10 More Discussions You Might Find Interesting
1. Shell Programming and Scripting
Hi all,
Does anybody know or guide me on how to remove the first N bytes and the last N bytes from a binary file? Is there any AWK or SED or any command that I can use to achieve this?
Your help is greatly appreciated!!
Best Regards,
Naveen. (1 Reply)
Discussion started by: naveendronavall
1 Replies
2. UNIX for Dummies Questions & Answers
I'm looking to determine if I can use a grep command to read file and rename the file based on the first 3 characters of the data in the file.
An example is:
Read FileA
If the first 3 positions of the data in the file are "ITP", then rename the file as FileA_ITP, else if the first 3... (3 Replies)
Discussion started by: jchappel
3 Replies
3. Shell Programming and Scripting
I'm still up trying to figure this out and it is driving me nuts.
I have a log file which has a basic format of this...
2010-10-10 22:25:42
Init block 'UA Deployment Date': Dynamic refresh of repository scope variables has failed.
The ODBC function has returned an error. The database... (4 Replies)
Discussion started by: k1ko
4 Replies
4. Shell Programming and Scripting
I have a big file with many brackets () in it from which I need to parse number characters and numbers. Below is an example of my file
14 (((A__0:0.02,B__1:0.3)0:0.04,C__0:0.025)2:0.01),(D__0:0.00978,E__2:0.01031)1:0.00362;
15... (1 Reply)
Discussion started by: Lucky Ali
1 Replies
5. UNIX for Dummies Questions & Answers
Hello guys. I really hope someone will help me with this one..
So, I have to write this script who:
- creates a file home/student/vmdisk of 10 mb
- formats that file to ext3
- mounts that partition to /mnt/partition
- creates a file /mnt/partition/data. In this file, there will... (1 Reply)
Discussion started by: razolo13
1 Replies
6. Shell Programming and Scripting
Hello All,
I have two VERY large .csv files that I want to compare values based on substrings. If the lines are unique, then print the line.
For example, if I run a
diff file1.csv and file2.csv
I get results similar to
+_id34,brown,car,2006
+_id1,blue,train,1985... (5 Replies)
Discussion started by: jl487
5 Replies
7. Shell Programming and Scripting
I have a file1 like
ID E2AK1_HUMAN Reviewed; 630 AA.
CC -!- SUBCELLULAR LOCATION: Host nucleus {ECO:0000305}.
ID E1A_ADEM1 Reviewed; 200 AA.
ID E1A_ADES7 Reviewed; 266 AA.
CC -!- SUBCELLULAR LOCATION: Host nucleus... (8 Replies)
Discussion started by: sammy777
8 Replies
8. Shell Programming and Scripting
I have a list file1 like
dog
cow
fox
cat
fish
duck
crowI want to classify the elements of file1 based on constrains applied on file2. Additionally the number of elements (words) in the each line of file2 is not fixed. This is my file2
cow cat fox dog
cow fox dog
fish crow fox dog cat ... (5 Replies)
Discussion started by: sammy777
5 Replies
9. Shell Programming and Scripting
Hi All,
I'm having a hard time finding a starting point for my issue. I have a 30k line file (fspsec.txt) that I would like to parse into smaller files based on any character existing in field 1.
ACCOUNTANT LEVEL 1 (ACCT.ACCOUNTANT)
OPERATORS: DOEJO (418)
TOOLS: Branch Maintenance
... (2 Replies)
Discussion started by: aahlrich
2 Replies
10. Shell Programming and Scripting
I want to parse a file containing special characters, below is a sample content of file
content of file :
Serial_no:1$$@#first_name:Rahane$$@last_name:Ajiyenke@@#profession:cricketer!@#*&^
Serial_no:1$$@#first_name:Rahane$$@last_name:Ajiyenke@@#profession:cricketer!@#*&^... (3 Replies)
Discussion started by: rajMjar
3 Replies
LEARN ABOUT OPENDARWIN
wc
WC(1) BSD General Commands Manual WC(1)
NAME
wc -- word, line, character, and byte count
SYNOPSIS
wc [-clmw] [file ...]
DESCRIPTION
The wc utility displays the number of lines, words, and bytes contained in each input file (or standard input, by default) to the standard
output. A line is defined as a string of characters delimited by a <newline> character, and a word is defined as a string of characters
delimited by white space characters. White space characters are the set of characters for which the iswspace(3) function returns true. If
more than one input file is specified, a line of cumulative counts for all the files is displayed on a separate line after the output for the
last file.
The following options are available:
-c The number of bytes in each input file is written to the standard output.
-l The number of lines in each input file is written to the standard output.
-m The number of characters in each input file is written to the standard output. If the current locale does not support multibyte
characters, this is equivalent to the -c option.
-w The number of words in each input file is written to the standard output.
When an option is specified, wc only reports the information requested by that option. The default action is equivalent to specifying the
-c, -l and -w options.
If no files are specified, the standard input is used and no file name is displayed.
ENVIRONMENT
The LANG, LC_ALL and LC_CTYPE environment variables affect the execution of wc as described in environ(7).
EXAMPLES
Count the number of characters, words and lines in each of the files report1 and report2 as well as the totals for both:
wc -mlw report1 report2
DIAGNOSTICS
The wc utility exits 0 on success, and >0 if an error occurs.
SEE ALSO
iswspace(3)
COMPATIBILITY
Historically, the wc utility was documented to define a word as a ``maximal string of characters delimited by <space>, <tab> or <newline>
characters''. The implementation, however, didn't handle non-printing characters correctly so that `` ^D^E '' counted as 6 spaces, while
``foo^D^Ebar'' counted as 8 characters. 4BSD systems after 4.3BSD modified the implementation to be consistent with the documentation. This
implementation defines a ``word'' in terms of the iswspace(3) function, as required by IEEE Std 1003.2 (``POSIX.2'').
STANDARDS
The wc utility conforms to IEEE Std 1003.1-2001 (``POSIX.1'').
HISTORY
A wc command appeared in Version 1 AT&T UNIX.
BSD
June 13, 2002 BSD