This is the Test_Data.snp file:
MEGAUPLOAD - The leading online storage and file delivery service
1. The problem statement, all variables and given/known data:
Problem Set:
Before you get started working with these challenges, be aware that the first challenge is reformatting the test data file so that you get rid of the ‘header' and get all of the columns
delimited for working with in unix. (I'll give you another clue in addition to getting rid of the header, learn ‘grep', ‘cat', ‘cut', ‘awk', ‘sed' )
write a script to change the extension of your file : Test_Data.snp to Test_Data.txt
print all lines that have an ‘A' base call either in the reference (column 2) or query (column 3) strain
print only column titled ‘LEN R' to a new file called Reference_length.txt
sort the file by column 4 ( titled [P2])
print only the lines that have a basecall in columns 2 and 3 (under [SUB] headings) and sort by [LEN R] , output to new file called snp_report.txt
2. Relevant commands, code, scripts, algorithms:
I'm not sure what this means?
3. The attempts at a solution (include all code and scripts):
The only thing I know how to do is actually show the data set in the terminal window
4. Complete Name of School (University), City (State), Country, Name of Professor, and Course Number (Link to Course):
This is part of a learning scholarship over the summer. I am working with Dr. Mia Champion of TGEN North in Flagstaff. She recommended that I come here for help.
Thanks for any help you can provide. I literally just started learning this a day ago, so please bear with me.