Sponsored Content
Top Forums UNIX for Beginners Questions & Answers awk to convert dict file to table format Post 303045533 by biofreek on Monday 30th of March 2020 10:10:32 PM
Old 03-30-2020
awk to convert dict file to table format

HI
I have a file that looks like below

Code:
abc
{Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}
BBC
{Seq('AGGATA', SingleLetterAlphabet()): 1, Seq('GGATAG', SingleLetterAlphabet()): 1, Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}
CCB
{Seq('GACGGA', SingleLetterAlphabet()): 1, Seq('ACGGAT', SingleLetterAlphabet()): 1, Seq('CGGATA', SingleLetterAlphabet()): 1, Seq('GGATAG', SingleLetterAlphabet()): 1, Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}

I wanted to get something like:
Code:
abc GATAGC
abc ATAGCG
abc TAGCGG
BBC AGGATA
---------------

I have been trying to get the solution in python but wth less success. Is there any easy way to do it in awk?
 

8 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Convert UTF8 Format file to ANSI format

:) Hi i am trying to convert a file which is in UTF8 format to ANSI format i tried to use the function ICONV but it is throwing error Function i used it as $ iconv -f UTF8 -t ANSI filename Error iam getting is NOT Supported UTF8 to ANSI please some help me out on this.........Let me... (1 Reply)
Discussion started by: rajreddy
1 Replies

2. UNIX for Dummies Questions & Answers

Convert UTF8 Format file to ANSI format

:confused: Hi i am trying to convert a file which is in UTF8 format to ANSI format i tried to use the function ICONV but it is throwing error Function i used it as $ iconv -f UTF8 -t ANSI filename Error iam getting is NOT Supported UTF8 to ANSI please some help me out on... (9 Replies)
Discussion started by: rajreddy
9 Replies

3. UNIX for Dummies Questions & Answers

To convert multi format file to a readable ascii format

Hi I have a file which has ascii , binary, binary decimal coded,decimal & hexadecimal data with lot of special characters (like öƒ.ƒ.„İİ¡Š·œƒ.„İİ¡Š· ) in it. I want to standardize the file into ASCII format & later use that as source . Can any one suggest a way a logic to convert such... (5 Replies)
Discussion started by: gaur.deepti
5 Replies

4. UNIX for Dummies Questions & Answers

Convert UNIX file format to PC format

Hi All, Is there any way to convert a file which is in UNIX format to a PC format.... Flip command can be used , apart form this command can we have any other way.... like usinf "awk" etc ..... main purpose of not using flip is that my Kshell doesnot support this comamnd.... (1 Reply)
Discussion started by: Samtel
1 Replies

5. Programming

awk script to convert a text file into csv format

hi...... thanks for allowing me to start a discussion i am collecting usb usage details of all users and convert it into csv files so that i can export it into some database.. the input text file is as follows:- USB History Dump by nabiy (c)2008 (1) --- Kingston DataTraveler 130 USB... (2 Replies)
Discussion started by: certteam
2 Replies

6. Shell Programming and Scripting

convert the output in table format

Hi All, I have a output like below values val1=test.com val2=10.26.208.11 val3=en1 val4=test-priv1.com val5=192.168.3.4 val6=en2 val7=test-priv2.com val8=192.168.4.4 val9=en3 val10=test-vip.com val11=10.26.208.9 val12=$val3 I want to convet this output values into below... (1 Reply)
Discussion started by: kamauv234
1 Replies

7. Shell Programming and Scripting

awk to convert table-by-row to matrix table

Hello, I need some help to reformat this table-by-row to matrix? infile: site1 A:o,p,q,r,s,t site1 C:y,u site1 T:v,w site1 -:x,z site2 A:p,r,t,v,w,z site2 C:u,y site2 G:q,s site2 -:o,x site3 A:o,q,s,t,u,z site3 C:y site3 T:v,w,x site3 -:p,routfile: SITE o p q r s t v u w x y... (7 Replies)
Discussion started by: yifangt
7 Replies

8. Shell Programming and Scripting

Convert rows into columns and create table with awk

Hello I've four fields . They are First Name, Last Name, Age, Country. So when I run a Unix command, I get below output with these fields comes every time in different order as you can see. Some times first name is the first row and other time last name is first row in the output and etc etc..... (9 Replies)
Discussion started by: rprpr
9 Replies
IDFETCH(1)						     NCBI Tools User's Manual							IDFETCH(1)

NAME
idfetch - retrieve biological data from the NCBI ID1 server SYNOPSIS
idfetch [-] [-F str] [-G filename] [-Q filename] [-c N] [-d str] [-e N] [-f str] [-g N] [-i N] [-l filename] [-n] [-o filename] [-q str] [-s str] [-t N] DESCRIPTION
idfetch is a client for NCBI's ID1 server, which contains a large database of annotated biological sequences. OPTIONS
A summary of options is included below. - Print usage message -F str Add the specified feature types (comma-delimited); allowed values are CDD, SNP, SNP_graph, MGC, HPRD, STS, tRNA, and microRNA. -G filename File with list of GIs, (versioned) accessions, FASTA SeqIDs to dump -Q filename Generate GI list by Entrez query in filename; requires -dn or -dp. -c N Max complexity: 0 get the whole blob (default) 1 get the bioseq of interest 2 get the minimal bioseq-set containing the bioseq of interest 3 get the minimal nuc-prot containing the bioseq of interest 4 get the minimal pub-set containing the bioseq of interest -d str Database to use (with -q, can be either n for nucleotides or p for proteins). -e N Entity number (retrieval number) to dump -f str Flattened SeqId. Possible formats: type([name][,[accession][,[release][,version]]]) as '5(HUMHBB)' type=accession type:number (type is a number indicating the ASN.1 Seq-id subtype.) -g N GI id for single Entity to dump -i N Type of lookup: 0 get Seq-entry (default) 1 get GI state (output to stderr) 2 get SeqIds 3 get GI history (sequence change only) 4 get revision history (any change to ASN.1) -l filename Log file -n Output only the list of GIs (with -q and -Q). -o filename Filename for output (default = stdout) -q str Generate gi list by Entrez query. Requires -dn or -dp. -s str FASTA style SeqId ENCLOSED IN QUOTES. Formats: lcl|int or str bbs|int bbm|int gb|acc|loc emb|acc|loc pir|acc|name sp|acc|name pat|country|patent|seq gi|int dbj|acc|loc prf|acc|name pdb|entry|chain -t N Output type: 1 text ASN.1 (default) 2 binary ASN.1 3 GenBank (Seq-entry only) 4 GenPept (Seq-entry only) 5 FASTA (table for history) 6 quality scores (Seq-entry only) 7 Entrez DocSums 8 FASTA reverse complement AUTHOR
The National Center for Biotechnology Information. NCBI
2011-09-02 IDFETCH(1)
All times are GMT -4. The time now is 12:35 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy