06-04-2012
Linux in Big Data projects
Hey guys, we will be interested in learning from your experience in using Linux in Big Data projects. Has anyone used Hadoop, or MapR or Horton Works on Linux and any experiences you may have had on these. I am more interested in knowing if a certain distribution of Linux is better supported for Hadoop and why? Also would like to know if anyone is using Gluster, and if so, are there any other alternatives similar to Gluster?
7 More Discussions You Might Find Interesting
1. UNIX for Dummies Questions & Answers
i have some problem in linux booting
will u please help me
the problem is
i was using federo core 1 on my system
everything was fine
i made one entry in /etc/fstab file for accessing E
drive of WINDOWS XP
in that i had given file system as VFAT after
rebooting system it
was not... (1 Reply)
Discussion started by: great_indian
1 Replies
2. Shell Programming and Scripting
Morning guys. Another day another question. :rolleyes:
I am knocking up a script to pull some data from a file. The problem is the file is very big (up to 1 gig in size), so this solution:
for results in `grep "^\
... works, but takes ages (we're talking minutes) to run. The data is held... (8 Replies)
Discussion started by: dlam
8 Replies
3. Shell Programming and Scripting
How to cut data from big file
my file around 30 gb
I tried "head -50022172 filename > newfile.txt ,and tail -5454283 newfile.txt. It's slowy.
afer that I tried sed -n '46467831,50022172p' filename > newfile.txt ,also slow
Please recommend me , faster command to cut some data from... (4 Replies)
Discussion started by: almanto
4 Replies
4. Shell Programming and Scripting
Hi,
I did read a few posts on the subjects, tried out a few solutions, but did not solve my problem.
https://www.unix.com/302121568-post11.html
https://www.unix.com/shell-programming-scripting/137953-large-file-columns-into-rows-etc-4.html
Please help. Problem very similar to the second link... (15 Replies)
Discussion started by: genehunter
15 Replies
5. Shell Programming and Scripting
Hello,
I have a big data file (160 MB) full of records with pipe(|) delimited those fields. I`m sorting the file on the first field.
I'm trying to sort with "sort" command and it brings me 6 minutes.
I have tried with some transformation methods in perl but it results "Out of memory". I was... (2 Replies)
Discussion started by: rubber08
2 Replies
6. Shell Programming and Scripting
The dataset I'm working on is about 450G, with about 7000 colums and 30,000,000 rows.
I want to extract about 2000 columns from the original file to form a new file.
I have the list of number of the columns I need, but don't know how to extract them.
Thanks! (14 Replies)
Discussion started by: happypoker
14 Replies
7. What is on Your Mind?
Hello,
I have been working as Solaris/Linux Admin since past 8 years. I am looking options for my profile change, but there is some limitation. I worked as 24x7 support for admin, server support, high availability, etc. But been worked on developing side and scripting part.
When I search for Big... (2 Replies)
Discussion started by: nightup2222
2 Replies
LEARN ABOUT CENTOS
mount.glusterfs
GlusterFS(8) Gluster Inc. GlusterFS(8)
NAME
mount.glusterfs - script to mount native GlusterFS volume
SYNOPSIS
mount -t glusterfs [-o <options>] <volumeserver>:<volumeid> <mountpoint>
mount -t glusterfs
[-o <options>] <path/to/volumefile> <mountpoint>
DESCRIPTION
This tool is part of glusterfs(8) package, which is used to mount using GlusterFS native binary.
mount.glusterfs is meant to be used by the mount(8) command for mounting native GlusterFS client. This subcommand, however, can also be
used as a standalone command with limited functionality.
OPTIONS
Basic options
log-file=LOG-FILE
File to use for logging [default:/var/log/glusterfs/glusterfs.log]
log-level=LOG-LEVEL
Logging severity. Valid options are TRACE, DEBUG, WARNING, ERROR, CRITICAL INFO and NONE [default: INFO]
ro Mount the filesystem read-only
Advanced options
volfile-id=KEY
Volume key or name of the volume file to be fetched from server
transport=TRANSPORT-TYPE
Transport type to get volume file from server [default: tcp]
volume-name=VOLUME-NAME
Volume name to be used for MOUNT-POINT [default: top most volume in VOLUME-FILE]
direct-io-mode=disable
Disable direct I/O mode in fuse kernel module
FILES
/etc/fstab
A typical GlusterFS entry in /etc/fstab looks like below
server1.gluster.com:mirror /mnt/mirror glusterfs log-file=/var/log/mirror.vol,ro,defaults 0 0
/etc/mtab
An example entry of a GlusterFS mountpoint in /etc/mtab looks like below
mirror.vol /mnt/glusterfs fuse.glusterfs rw,allow_other,default_permissions,max_read=131072 0 0
SEE ALSO
glusterfs(8), mount(8), gluster(8)
COPYRIGHT
Copyright(c) 2006-2011 Gluster, Inc. <http://www.gluster.com>
18 March 2010 Cluster Filesystem GlusterFS(8)