База знаний

Linux - Delete lines with duplicate data

If for example you dont want the same email address to be in your .csv files you would run the following command to
awk '!x[$2]++' FS="," FILENAME.csv > newname.csv

FILENAME.csv needs to be replaced with your .csv file and [$2] needs to be edited to represent the line you want to check for duplicate content.

Этот ответ был полезен?

 Распечатать статью

Также читают

Linux - How to combine multiple .csv files

To combine multiple files with the letters "2014" and "2015" in their name run the following...

Linux - Only keep certain country data

< FileToExtract.csv awk 'BEGIN {FS=","} tolower($19)~/usa|united states/' > USAList.txtThis...

Linux - Delete .csv column

Before you start you need to install Ubuntu - sudo apt-get install libtext-csv-perl CentOs -...

Linux - How to split large file

For this example we will split a large .csv into multiple files each "X" megabytes in size....

Linux - How to extract .gz files (gunzip)

For this guide we will be using gunzip. You may need to installed it.CentOS - yum install gunzip...