Sponsored Content
Top Forums UNIX for Dummies Questions & Answers Find common numbers from two very large files using awk or the like Post 302799569 by Corona688 on Friday 26th of April 2013 05:19:00 PM
Old 04-26-2013
How long are the lines in these input files?
 

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Get un common numbers from two files

Hi, I have two files: abc : 50040 123123 31703 cde: 104 97 50040 123123 31703 36609 50534 (3 Replies)
Discussion started by: jingi1234
3 Replies

2. Shell Programming and Scripting

To find all common lines from 'n' no. of files

Hi, I have one situation. I have some 6-7 no. of files in one directory & I have to extract all the lines which exist in all these files. means I need to extract all common lines from all these files & put them in a separate file. Please help. I know it could be done with the help of... (11 Replies)
Discussion started by: The Observer
11 Replies

3. UNIX for Dummies Questions & Answers

Grep alternative to handle large numbers of files

I am looking for a file with 'MCR0000000716214' in it. I tried the following command: grep MCR0000000716214 * The problem is that the folder I am searching in has over 87000 files and I am getting the following: bash: /bin/grep: Arg list too long Is there any command I can use that can... (6 Replies)
Discussion started by: runnerpaul
6 Replies

4. Shell Programming and Scripting

Drop common lines at head/tail of a large set of files

Hi! I have a large set of pairs of text files (each pair in their own subdirectory) and each pair shares head/tail (a couple of first and last lines) but differs in the middle part. I need to delete the heads/tails and keep only the middle portions in which they differ. The lengths of heads/tails... (1 Reply)
Discussion started by: dobryden
1 Replies

5. UNIX for Advanced & Expert Users

Find common Strings in two large files

Hi , I have a text file in the format DB2: DB2: WB: WB: WB: WB: and a second text file of the format Time=00:00:00.473 Time=00:00:00.436 Time=00:00:00.016 Time=00:00:00.027 Time=00:00:00.471 Time=00:00:00.436 the last string in both the text files is of the... (4 Replies)
Discussion started by: kanthrajgowda
4 Replies

6. Shell Programming and Scripting

finding common numbers (contents) across 2 or 3 files

I have 3 files which are tab delimited and have numbers in it. file 1 1 2 3 4 5 6 7 File 2 3 5 7 8 File 3 1 (4 Replies)
Discussion started by: Lucky Ali
4 Replies

7. Shell Programming and Scripting

Find common numbers and print yes or no

Hi I have 2 files with following data First file, sp|Q676U5|A16L1_HUMAN, Autophagy-related protein 16-1 OS=Homo sapiens GN=ATG16L1 PE=1 SV=2, Maximum coiled-coil residue probability: 0.657 in position 163. Maximum dimeric residue probability: 0.288 in position 163. ... (1 Reply)
Discussion started by: manigrover
1 Replies

8. Shell Programming and Scripting

Find Common Values Across Two Files

Hi All, I have two files like below: File1 MYFILE_28012012_1112.txt|4 MYFILE_28012012_1113.txt|51 MYFILE_28012012_1114.txt|57 MYFILE_28012012_1115.txt|57 MYFILE_28012012_1116.txt|57 MYFILE_28012012_1117.txt|57 File2 MYFILE_28012012_1110.txt|57 MYFILE_28012012_1111.txt|57... (2 Replies)
Discussion started by: angshuman
2 Replies

9. Shell Programming and Scripting

Find common files between two directories

I have two directories Dir 1 /home/sid/release1 Dir 2 /home/sid/release2 I want to find the common files between the two directories Dir 1 files /home/sid/release1>ls -lrt total 16 -rw-r--r-- 1 sid cool 0 Jun 19 12:53 File123 -rw-r--r-- 1 sid cool 0 Jun 19 12:53... (5 Replies)
Discussion started by: sidnow
5 Replies
MTBL(7) 																   MTBL(7)

NAME
mtbl - immutable sorted string library SYNOPSIS
#include <mtbl.h> gcc [flags] files -lmtbl [libraries] DESCRIPTION
The mtbl library provides interfaces for creating, searching, and merging Sorted String Table (SSTable) files in the MTBL format, which provide an immutable mapping of keys to values. Sorted String Tables are compact and provide fast random access to keys and key ranges. Keys and values are arbitrary byte arrays, and MTBL SSTables may not contain duplicate keys. The six main interfaces provided by the mtbl library are: mtbl_iter(3) Iterator objects provide a consistent interface for iterating over the key-value entries returned by other interfaces. mtbl_source(3) Source objects provide functions for obtaining iterators from an underlying data source. The mtbl_reader and mtbl_merger interfaces provide functions for obtaining references to a source object. The source methods return an mtbl_iter object. mtbl_reader(3) Reader objects provide read-only access to MTBL files. mtbl_writer(3) Writer objects initialize a new MTBL file from a sequence of key-value entries provided by the caller. Keys must be in sorted order based on lexicographical byte value, and keys may not be duplicated. mtbl_merger(3) Merger objects receive multiple sequences of key-value entries from one or more mtbl_source objects and combine them into a single, sorted sequence. The combined, merged output sequence is provided via the mtbl_source interface. mtbl_sorter(3) Sorter objects receive a sequence of key-value entries provided by the caller and return them in sorted order. The caller must provide a callback function to merge values in the case of entries with duplicate keys. The sorted output sequence may be retrieved via the mtbl_iter interface or be dumped to an mtbl_writer object. mtbl_fileset(3) Fileset objects automatically maintain an mtbl_source built on top of the mtbl_merger and mtbl_reader interfaces. The set of underlying mtbl_reader objects is kept synchronized with a "setfile" on disk listing MTBL files. Additionally, several utility interfaces are provided: mtbl_crc32c(3) Calculates the CRC32C checksum of a byte array. mtbl_fixed(3) Functions for fixed-width encoding and decoding of 32 and 64 bit integers. mtbl_varint(3) Functions for varint encoding and decoding of 32 and 64 bit integers. 05/29/2012 MTBL(7)
All times are GMT -4. The time now is 01:59 PM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy