Bash selection of files with similar name


Login or Register for Dates, Times and to Reply

 
Thread Tools Search this Thread
Top Forums UNIX for Beginners Questions & Answers Bash selection of files with similar name
# 1  
Bash selection of files with similar name

Hi all,
This is my first day on Linux shell!!!

So, I am trying to write a script that that will pick up pairs of files with the same name (not the same content) but that are different in one character (one is *R1 the other is *R2)...

Something like: look ate the files, whenever they are the same except in the R1/R2, than run ...

The if that I tried is I think is going into the the file content rather than the name....


Did this make any sense?



Thanks a lot.



This is what I was able to do so far....
Code:
#!/bin/bash

file1_path="~/All_files/"
var1=S*R1.fastq.gz
echo $var1 

file2_path="~/All_files/"
var2=S*R2.fastq.gz  
echo $var2


if ["${VAR1:0:18}" = "${VAR2:0:18}"] ; then


Moderator's Comments:
Mod Comment
Please wrap all code, files, input & output/errors in CODE tags.
It makes it far easier to read and preserves spaces for indenting and/or fixed-width data.

Last edited by rbatte1; 1 Week Ago at 11:53 AM..
# 2  
Welcome to the forum, firstly some housekeeping
...
To keep the forums high quality for all users, please take the time to format your posts correctly.

First of all, use Code Tags when you post any code or data samples so others can easily read your code. You can easily do this by highlighting your code and then clicking on the # in the editing menu. (You can also type code tags [code] and [/code] by hand.)



Second, avoid adding color or different fonts and font size to your posts. Selective use of color to highlight a single word or phrase can be useful at times, but using color, in general, makes the forums harder to read, especially bright colors like red.

Third, be careful when you cut-and-paste, edit any odd characters and make sure all links are working property.

Thank You.

The UNIX and Linux Forums
...

The variables var1 and var2 actually contain the literal strings you have assigned to them (i.e S*R1.fastq.gz and S*R2.fastq.gz) these are expanded by the shell when they are build into the echo command. To demonstrate:
Code:
$ var1=S*R1.fastq.gz
$ touch SecondR1.fastq.gz
$ echo "$var1"
S*R1.fastq.gz
$ echo $var1
SecondR1.fastq.gz
$ echo "${var1:1:1}"
*

See how using double quotes protects the string from being expanded by the shell. You could do something like this to have the shell expand the wildcard:
Code:
$ var1=S*R1.fastq.gz
$ touch SecondR1.fastq.gz
$ var1=$(echo $var1)
$ echo "$var1"
SecondR1.fastq.gz
$ echo ${var1:6:2}
R1

--- Post updated at 08:13 AM ---

One approach to this problem is to populate an array with the expanded filenames and then step thru it comparing with the next entry like this:
Code:
REST=.fastq.gz
file=( S*R[12]$REST )

for((i=0;i<${#file[@]};i++))
do
    if [ "${file[i]/%R1$REST/R2$REST}" = "${file[i+1]}" ]
    then
        echo ${file[i]} and ${file[i+1]} are similar
        ((i++))
    fi
done


Last edited by Chubler_XL; 2 Weeks Ago at 06:27 PM.. Reason: Updated search/replace to avoid matching on R1 in early part of filename
# 3  
Thank you very much for the help.
I am sorry about missing out on the rules of posting. I just focused on the "no smoking"one!!! Smilie
I am missing out on a lot!!!!! I have found a few but do you have any recommendation on a website that can take me step by step on programming on unix?



Thanks once again.
# 4  
You may find this on-line version of "The Are of Unix Programming" useful:

The Art of Unix Programming

A quick google search for "Bash tips and tricks" turned up this fairly good article on bash:

101 Bash Commands and Tips for Beginners to Experts - DEV Community 👩*💻👨*💻

I pretty much learnt awk programming from this site . Initially I was amazed how easily awk could be used to solve problems posted here. I worked thru the solutions proposed by others and used google and other on-line resources to nut out what was going on. I did start with a lot of programming experience in other languages, which gave me a head start. Start by understanding the supplied solutions and work up to figuring out your own answers without looking at how others have done it first. Compare what you have come up with to the supplied answer. It's fairly important that you have access to a system you can trial and debug solutions on.
Login or Register for Dates, Times and to Reply

Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

Test Your Knowledge in Computers #165
Difficulty: Easy
Tanenbaum originally developed MINIX for compatibility with Apple Computers.
True or False?

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Vlookup using awk non similar files

I need to vlookup and check the server not found. Source file 1 server1 server2 server3 server4 server5_root server6_silver server7 server7-test server7-temp Source file 2 server1_bronze (6 Replies)
Discussion started by: ranjancom2000
6 Replies

2. Solaris

Getting similar lines in two files

Hi, I need to compare the /etc/passwd files from 2 servers, and extract the users that are similar in these two files. I sorted the 2 files based on the user IDs (UID) (3rd column). I first sorted the files using the username (1st column), however when I use comm to compare the files there is no... (1 Reply)
Discussion started by: anaigini45
1 Replies

3. Shell Programming and Scripting

Bash/shell merge similar lines

Hello, I've been working on a bash script to parse through firewall logs (cisco). I'm nearing the end and have a dilemma. My data looks as such (actual data is several gigs worth of logs - without the headers): sourceIP destinationIP destinationProtocol destinationPort 1.1.1.1 2.2.2.2 ... (2 Replies)
Discussion started by: umang2382
2 Replies

4. Shell Programming and Scripting

Yes or No selection within bash function

I need to add a selection within the bash function below and am having some trouble doing so. phox2b() { printf "\n\n" printf "What is the id of the patient getting Phox2B analysis : "; read id printf "Is this an intronic variant? Y/N "; read match_choice case... (5 Replies)
Discussion started by: cmccabe
5 Replies

5. Shell Programming and Scripting

Looking to find files that are similar.

Hello all, I have a server that is running AIX, running a tool that converts various printstreams (AFP/Metadata) to PDF. This is done using a rexx script and an off the shelf utility. Each report (there's around 125) uses a certain script file, it's basically a text file. I am trying... (5 Replies)
Discussion started by: jeffs42885
5 Replies

6. Shell Programming and Scripting

compare the similar files

I got many pair files, which only have small difference, such as more space, or more empty line, and some unreadable characters. If list by commend "diff", I can see many many difference. So I'd like to write a script to compare the pair files, if 95% contents are same, I will think they are... (2 Replies)
Discussion started by: rdcwayx
2 Replies

7. Shell Programming and Scripting

Paging/Scrolling a List of files for selection

Hi All, I have the following script and would like to add page up and down functionality to be able to stroll through the list. Is this possible? If so, Any pointers to examples, info, tips, suggestions would be appreciated. BTW, Any suggestion for better practices on the following code... (5 Replies)
Discussion started by: Festus Hagen
5 Replies

8. Shell Programming and Scripting

selection of context in files

I have a file like this QUEUE: <ITEM(69)> "/NLA///ACHO_EQU_IDX" Q_KEY: <ITEM(69)> "/NLA///ACHO_EQU_IDX" Q_TYPE: <VSTR(32)> "GEN_VSTR_INDEX" ... (1 Reply)
Discussion started by: manas_ranjan
1 Replies

9. Shell Programming and Scripting

Comparing similar columns in two different files

Hi, I have two text files.The first and the 2nd file have data in the same format For e.g. The first file has table_name1 column1 sum(column1) max(column1) min(column1) table_name1 column2 sum(column2) max(column2) min(column2) table_name1 coulmn3 sum(column3) max(column3) min(column3) ... (13 Replies)
Discussion started by: ragavhere
13 Replies

10. UNIX for Advanced & Expert Users

selection of files based on its types

Daily we are getting some datafiles to our unix server location FTPIN. Incoming File names will be present in the location "/xyz/test/" as below: "infile_A1_YYYYMMDD", "infile_A2_YYYYMMDD", "infile_B1_YYYYMMDD", "infile_C1_YYYYMMDD" "infile_C2_YYYYMMDD" Where A, B and C are the... (3 Replies)
Discussion started by: ganapati
3 Replies

Featured Tech Videos