Fast regex bulk string comparison script in linux (10m records vs blacklist with 100k records )

Closed Posted 5 years ago Paid on delivery
Closed Paid on delivery

Need a script I can call from PHP on ubuntu server, with 2 inputs.

$[login to view URL] = cleaningFunction([login to view URL],[login to view URL]);

Each line is a separate record. Blacklist will contain strings in the formats:

badword

badword*followedbythisword

prefixedwiththisword*badword

* representing any number of characters in between. Will function like an ad blocker blacklist.

Speed is important, show tests before you bid on how fast you can generate $[login to view URL] with a 10m record list against 100k blacklist. Results should include an array of the records triggered, with corresponding line numbers and blacklist terms.

Linux PHP Python Regular Expressions Software Architecture

Project ID: #17726585

About the project

4 proposals Remote project Active 5 years ago

4 freelancers are bidding on average $65 for this job

leandrozhuzhi

Hello, my name is Leandro and i can create your script. I already have a couple similar scripts in place. Since you are using Linux, any particular reason for wanting PHP? This can be best made through a shell script. More

$50 USD in 2 days
(4 Reviews)
2.3
$25 USD in 1 day
(2 Reviews)
2.0