Find Jobs
Hire Freelancers

Address Data Parser with AI and Nominatim

$750-1500 USD

Closed
Posted almost 10 years ago

$750-1500 USD

Paid on delivery
In this project, we are looking for a developer to create pure java artificial intelligence based console application, using a machine learning library such as Weka ([login to view URL]) in concert with OSM’s Nominatim search library ([login to view URL]) that will parse dirty address data from financial transactions. The purpose of the application is to parse a single line of address data, containing addresses that may be associated with any country into the an output that contains certain basic elements: Name: The name of an entity or individual; Address_Line_1: The first line of a business address, not including a suite or floor number; Address_Line_2: The second line of a business address, that should only include a suite or floor number; City: The city of the address Subdivision: The state, or province or other country subdivision of the address; Postal_Code: The postal or ZIP code of the address; and Country: The country of the address (in 2 character ISO format) Some addresses in the data set should not be considered parsable, and those addresses are only considered addresses that do not contain at least a street address (Address_Line_1) and one other address feature. Success on this project will be deemed to be that the final application will have the ability to properly parse 90% of all parsable addresses in a test data set of 10,000 records. We will provide sample data in two phases: 1. Initial phase – 1000 Addresses (provided only to selected developer); 2. Developer Testing Phase – 10000 Addresses After developer testing, we will perform our own test before work acceptance on a separate set of 10,000 addresses. Only once the benchmark performance established above is met, we will release the final milestone. Example for this parsing might be as follows: Example 1: FAKE COMPANYNAME LLC 6421 LAKE WASHINGTON BLVD NE SUITE 101 KIRKLAND WA 98033 Name: Fake Companyname LLC Address_Line_1: 6421 Lake Washington BLVD NE Address_Line_2: Suire 101 City: Kirkland Subdivision: Washington Postal_Code 98033 Country: US In this case, all data was easily parsed, however the country code of US would need to be automatically appended by the process. The country should be recognized based on the format of the ZIP code and the State code of Washington. Example 2: SOMEADDRESS, Inc. 51 MOORGATE LONDON EC2R 6BH Name: SomeAddress, Inc. Address_Line_1: 52 Mooregate Address_Line_2: Null City: London Subdivision: England Postal_Code Ec2R 6BH Country: UK In this case, most data was easily parsed. The Subdivision and Country were appended, and should have been recognized by the AI process based on recognition of the Postal Code and City. Example 3: RENTEN SERVICE PAYER DEUTSCHE POSTBANK AG KENNEDYALLEE 62 - 70 BONN DE Name: Renten Service Payer Deutsche Postbank AG Address_Line_1: Kennedyallee 62-70 Address_Line_2: Null City: Bonn Subdivision: North Rhine-Westphalia Postal_Code: 53175 Country: DE In this case, the German address required the append of the postal code, and subdivision, which was obtained from Nominiatim after the address was parsed. All other data was present. Example 4: FRANCOIS HOLLANDE 35 WOODSFORD SQUARE LONDON Name: Francois Hollande Address_Line_1: 35 Woodsford Square Address_Line_2: Null City: London Subdivision: England Postal_Code: W11 4PY Country: UK Searching the address string provided, 35 Woodsford Square London against Nominatim finds that this address only exists in London UK enabling population of the Subdivision, Postal Code, and Country.
Project ID: 5938621

About the project

10 proposals
Remote project
Active 10 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $1,321 USD for this job
User Avatar
A proposal has not yet been provided
$1,052 USD in 15 days
4.9 (150 reviews)
7.1
7.1
User Avatar
Ready to start ! Ready to start ! Ready to start ! Ready to start ! Ready to start ! Ready to start !
$750 USD in 5 days
4.7 (53 reviews)
6.0
6.0
User Avatar
Hi, I am a McGill student in computer science. I am a very strong coder and I main in Java. I finished a course called Artificial Intelligence hence I am know what you guys are doing and the project is very interesting. My goal from this project is to gain experience and I hope we could work together. Regards, Paul Husek
$750 USD in 7 days
5.0 (2 reviews)
0.3
0.3
User Avatar
I am a B. Tech. Student from reputed college and have done my semester projects inAI. You can be assured of good res ults and the price is a bit negotiabl.
$1,022 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
$1,250 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
DO NOT PAY TILL PROJECT IS COMPLETE! Hi I am Brian, I am from Cyboticx a leading Digital Agency specializing in product + brand development and lead generation optimization. Before getting into more details lets put the following out there, we do not charge before work is complete. We are able to provide references from some of the largest companies in America that we have worked with. We have several branch offices (LA, San Jose, Chicago, and NJ) so you can reach us anytime. We have worked with both large enterprises as well as small mom and pop shops and we value each client equally. Let me know if you would like us to start working on a proper technical document and proposal for your project! You can view our portfolio athttps://www.freelancer.com/u/cyboticx.html
$2,061 USD in 30 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, I have worked on similar project that detects and parses bibliographical records in every web-page loaded by user, providing URL's to locate them in a college library. I can use this experience along with my experience in Weka to tackle the problem. Looking forward to the datasets. Thanks, /S/S/s
$1,500 USD in 20 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
SCOTTSDALE, United States
5.0
108
Payment method verified
Member since Jul 10, 2006

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.