Sponsored Content
Full Discussion: Remove lines with n columns
Top Forums Shell Programming and Scripting Remove lines with n columns Post 302095790 by Krispy on Friday 10th of November 2006 04:22:50 AM
Old 11-10-2006
Remove lines with n columns

Hi folks - hope you are all well.

I am trying to perform some pre-processing on a data file, to make sure it is in a valid format before performing a data upload.

Each row of data in the file should consist of 10 comma delimited fields.

Can anyone advise me of a sed/awk command that might check the file and remove any lines that are not equal to 10 fields in length?

Thanks in advance.
 

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Remove lines, Sorted with Time based columns using AWK & SORT

Hi having a file as follows MediaErr.log 84 Server1 Policy1 Schedule1 master1 05/08/2008 02:12:16 84 Server1 Policy1 Schedule1 master1 05/08/2008 02:22:47 84 Server1 Policy1 Schedule1 master1 05/08/2008 03:41:26 84 Server1 Policy1 ... (1 Reply)
Discussion started by: karthikn7974
1 Replies

2. Shell Programming and Scripting

Single command for add 2 columns and remove 2 columns in unix/performance tuning

Hi all, I have created a script which adding two columns and removing two columns for all files. Filename: Cust_information_1200_201010.txt Source Data: "1","Cust information","123","106001","street","1-203 high street" "1","Cust information","124","105001","street","1-203 high street" ... (0 Replies)
Discussion started by: onesuri
0 Replies

3. Shell Programming and Scripting

remove blank lines and merge lines in shell

Hi, I'm not a expert in shell programming, so i've come here to take help from u gurus. I'm trying to tailor a csv file that i got to make it work for the LOAD FROM command. I've a datatable csv of the below format - --in file format xx,xx,xx ,xx , , , , ,,xx, xxxx,, ,, xxx,... (11 Replies)
Discussion started by: dvah
11 Replies

4. UNIX for Dummies Questions & Answers

remove duplicate lines based on two columns and judging from a third one

hello all, I have an input file with four columns like this with a lot of lines and for example, line 1 and line 5 match because the first 4 characters match and the fourth column matches too. I want to keep the line that has the lowest number in the third column. So I discard line 5.... (5 Replies)
Discussion started by: TheTransporter
5 Replies

5. Shell Programming and Scripting

Remove nullable columns in lines

Hi Every one, my requirement is to remove the null columns in line, comma delimiter used For example, A,11,20,30,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, B1,,,,,, gem,plum,kite,,,,gud,bad,,,,,,,,,,,,, B2,kiing,kong,height,,,,,,,,,,,,,,,,,,,,,,,,,rak,,,,,,,,,,,,, B1,,,,,,... (9 Replies)
Discussion started by: skpshell
9 Replies

6. Shell Programming and Scripting

Two files, remove lines from second based on lines in first

I have two files, a keepout.txt and a database.csv. They're unsorted, but could be sorted. keepout: user1 buser3 anuser19 notheruser27 database: user1,2343,"information about",field,blah,34 user2,4231,"mo info",etc,stuff,43 notheruser27,4344,"hiya",thing,more thing,423... (4 Replies)
Discussion started by: esoffron
4 Replies

7. Shell Programming and Scripting

Remove lines with unique information in indicated columns

Hi, I have the 3-column, tab-separated following data: dot is-big 2 dot is-round 3 dot is-gray 4 cat is-big 3 hot in-summer 5 I want to remove all of those lines in which the values of Columns 1 and 2 are identical. In this way, the results would be as follows: dot is-big 2 cat... (4 Replies)
Discussion started by: owwow14
4 Replies

8. Shell Programming and Scripting

Remove lines that are subsets of other lines in File

Hello everyone, Although it seems easy, I've been stuck with this problem for a moment now and I can't figure out a way to get it done. My problem is the following: I have a file where each line is a sequence of IP addresses, example : 10.0.0.1 10.0.0.2 10.0.0.5 10.0.0.1 10.0.0.2... (5 Replies)
Discussion started by: MisterJellyBean
5 Replies

9. Shell Programming and Scripting

Merging multiple lines to columns with awk, while inserting commas for missing lines

Hello all, I have a large csv file where there are four types of rows I need to merge into one row per person, where there is a column for each possible code / type of row, even if that code/row isn't there for that person. In the csv, a person may be listed from one to four times... (9 Replies)
Discussion started by: RalphNY
9 Replies

10. Shell Programming and Scripting

awk to remove lines that do not start with digit and combine line or lines

I have been searching and trying to come up with an awk that will perform the following on a converted text file (original is a pdf). 1. Since the first two lines are (begin with) text they are removed 2. if $1 is a number then all text is merged (combined) into one line until the next... (3 Replies)
Discussion started by: cmccabe
3 Replies
deb-changes(5)							    dpkg suite							    deb-changes(5)

NAME
deb-changes - Debian changes file format SYNOPSIS
filename.changes DESCRIPTION
Each Debian upload is composed of a .changes control file, which contains a number of fields. Each field begins with a tag, such as Source or Binary (case insensitive), followed by a colon, and the body of the field. Fields are delimited only by field tags. In other words, field text may be multiple lines in length, but the installation tools will generally join lines when processing the body of the field (except in case of the multiline fields Description, Changes, Files, Checksums-Sha1 and Checksums-Sha256, see below). The control data might be enclosed in an OpenPGP ASCII Armored signature, as specified in RFC4880. FIELDS
Format: format-version (required) The value of this field declares the format version of the file. The syntax of the field value is a version number with a major and minor component. Backward incompatible changes to the format will bump the major version, and backward compatible changes (such as field additions) will bump the minor version. The current format version is 1.8. Date: release-date (required) The date the package was built or last edited. It must be in the same format as the date in a deb-changelog(5) entry. The value of this field is usually extracted from the debian/changelog file. Source: source-name [(source-version)] (required) The name of the source package. If the source version differs from the binary version, then the source-name will be followed by a source-version in parenthesis. This can happen when the upload is a binary-only non-maintainer upload. Binary: binary-package-list (required) This folded field is a space-separated list of binary packages to upload. Architecture: arch-list Lists the architectures of the files currently being uploaded. Common architectures are amd64, armel, i386, etc. Note that the all value is meant for packages that are architecture independent. If the source for the package is also being uploaded, the special entry source is also present. Architecture wildcards must never be present in the list. Version: version-string (required) Typically, this is the original package's version number in whatever form the program's author uses. It may also include a Debian revision number (for non-native packages). The exact format and sorting algorithm are described in deb-version(7). Distribution: distributions (required) Lists one or more space-separated distributions where this version should be installed when it is uploaded to the archive. Urgency: urgency (recommended) The urgency of the upload. The currently known values, in increasing order of urgency, are: low, medium, high, critical and emergency. Maintainer: fullname-email (required) Should be in the format "Joe Bloggs <jbloggs@example.org>", and is typically the person who created the package, as opposed to the author of the software that was packaged. Changed-By: fullname-email Should be in the format "Joe Bloggs <jbloggs@example.org>", and is typically the person who prepared the package changes for this release. Description: (recommended) binary-package-name - binary-package-summary This multiline field contains a list of binary package names followed by a space, a dash ('-') and their possibly truncated short descriptions. Closes: bug-number-list A space-separated list of bug report numbers that have been resolved with this upload. The distribution archive software might use this field to automatically close the referred bug numbers in the distribution bug tracking system. Binary-Only: yes This field denotes that the upload is a binary-only non-maintainer build. It originates from the binary-only=yes key/value from the changelog matadata entry. Built-For-Profiles: profile-list This field specifies a whitespace separated list of build profiles that this upload was built with. Changes: (required) changelog-entries This multiline field contains the concatenated text of all changelog entries that are part of the upload. To make this a valid multiline field empty lines are replaced with a single full stop ('.') and all lines are indented by one space character. The exact content depends on the changelog format. Files: (required) md5sum size section priority filename This multiline field contains a list of files with an md5sum, size, section and priority for each one. The first line of the field value (the part on the same line as the field name followed by a colon) is always empty. The content of the field is expressed as continuation lines, one line per file. Each line consists of space-separated entries describing the file: the md5sum, the file size, the file section, the file priority, and the file name. This field lists all files that make up the upload. The list of files in this field must match the list of files in the other related Checksums fields. Checksums-Sha1: (required) Checksums-Sha256: (required) checksum size filename These multiline fields contain a list of files with a checksum and size for each one. These fields have the same syntax and differ only in the checksum algorithm used: SHA-1 for Checksums-Sha1 and SHA-256 for Checksums-Sha256. The first line of the field value (the part on the same line as the field name followed by a colon) is always empty. The content of the field is expressed as continuation lines, one line per file. Each line consists of space-separated entries describing the file: the checksum, the file size, and the file name. These fields list all files that make up the upload. The list of files in these fields must match the list of files in the Files field and the other related Checksums fields. BUGS
The Files field is inconsistent with the other Checksums fields. The Change-By and Maintainer fields have confusing names. The Distribution field contains information about what is commonly referred to as a suite. SEE ALSO
deb-src-control(5), deb-version(7). 1.19.0.5 2018-04-16 deb-changes(5)
All times are GMT -4. The time now is 05:32 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy