A Collection of Text Processing Tools in Linux Shell

A Collection of Text Processing Tools in Linux Shell

(Click the public account above to quickly follow) From: cnblogs Link: http://www.cnblogs.com/me115/p/3427319.html This article will introduce the most commonly used tools for text processing in Shell under Linux: find, grep, xargs, sort, uniq, tr, cut, paste, wc, sed, awk; The examples and parameters provided are the most commonly used and practical; My principle for using … Read more

The Power of sed Command in Linux: Replacement Techniques

The Power of sed Command in Linux: Replacement Techniques

The replacement function of sed is very powerful and is also the most common use of sed. By default, sed does not actually modify the file. If you want to write the modified content back to the file, you need to add the -i option. Command syntax: sed ‘[address range|pattern range] s#[keyword to be replaced]#[replacement … Read more

The Three Musketeers of Linux: Mastering Regular Expressions, AWK, and Sed

The Three Musketeers of Linux: Mastering Regular Expressions, AWK, and Sed

Regular Expressions Basic regular expression ( ^, s, ., [], *) metacharacters Linux regular expressions are tools used for processing large amounts of strings, defining a set of rules and methods to match specific text patterns. Regular expressions consist of ordinary characters (such as letters a to z) and special characters (also known as metacharacters) … Read more

C++ Competition Daily Problem – Day 704

C++ Competition Daily Problem - Day 704

Today is the 704th day of learning programming with the cool rain! Hello, everyone! This is the problem from the GESP Level 3 Examination. Day 704 GESP Level 3 Examination in March 2025 Problem 2: Word Frequency Count Problem Description In text processing, counting the frequency of words is a common task. Now, given n … Read more

The Three Musketeers of Linux Commands – Using sed for Search and Append

The Three Musketeers of Linux Commands - Using sed for Search and Append

Introduction to sed sed is short for Stream Editor, also known as a stream editor. sed works like an assembly line in a workshop, where the characters to be processed are the raw materials on the assembly line, and after processing by sed, they become the finished goods. sed is mainly used in the following … Read more

A 10-Year Operations Veteran’s Linux Command Arsenal

A 10-Year Operations Veteran's Linux Command Arsenal

Click the blue “Most Programmer” to follow me! Add a “star“, every day at 18:03 to learn technology together Redirection Standard input stdin: code is 0, use < or <<Standard output stdout: code is 1, use > or >>Standard error output stderr: code is 2, use 2> or 2>>Special syntax: write both stdout and stderr … Read more

Understanding Grep Command in Linux

Understanding Grep Command in Linux

In Linux, the grep command is used for text searching. Whether processing logs, filtering files, or finding specific strings in a code repository, grep can perform remarkably well. 1. Basic Syntax The basic format of the grep command is:<span>grep [options] 'search pattern' [file]</span>. For example, to search for the word “linux” in the <span>run.log</span> file, … Read more

SpaCy – The Expert in Language Processing, An Industrial-Grade NLP Library

SpaCy - The Expert in Language Processing, An Industrial-Grade NLP Library

Introduction In the industrial applications of natural language processing (NLP), SpaCy serves as an experienced language processing expert. It provides Python developers with efficient and precise NLP solutions capable of handling large-scale and high-demand text processing tasks. Whether processing vast amounts of documents, building intelligent search engines, or developing complex language-related applications, SpaCy demonstrates exceptional … Read more

Using Linux Text Processing Tools for Subdomain Search

Using Linux Text Processing Tools for Subdomain Search

Application Background In practical scenarios, we often need to collect subdomains of target sites, usually using tools like subdomain excavators, oneforall, etc., or space search engines like fofa, shodan, and eagle map, or Google syntax. Besides the conventional methods, we can also utilize the Linux text processing tools we learned earlier to collect subdomains. Steps … Read more