Teadmistebaas

Linux - Only keep certain country data

< FileToExtract.csv awk 'BEGIN {FS=","} tolower($19)~/usa|united states/' > USAList.txt

This will only keep countries listed as USA/United States. It is not case sensative.

$19 represents the column number.

You can keep multiple countries by modifying the command, for example

/usa|united states|spain|germany/

Kas see vastus oli kasulik?

 Prindi artikkel

Loe veel

Linux - Download files through FTP

To download all files recursively run the following command. wget -nc -np --reject "index.html*"...

Linux - How to split large file

For this example we will split a large .csv into multiple files each "X" megabytes in size....

Linux - How to only keep X data

Firstly to start this you need to convert your .csv into a .txt mv filename.csv  filename.txt...

Linux - How to extract .gz files (gunzip)

For this guide we will be using gunzip. You may need to installed it.CentOS - yum install gunzip...

Linux - Delete lines with duplicate data

If for example you dont want the same email address to be in your .csv files you would run the...