Find Jobs
Hire Freelancers

Build Database to Store & Extract Data from Text Files (Easy $$)

$10-300 USD

Completato
Pubblicato più di 5 anni fa

$10-300 USD

Pagato al completamento
**I started this project with another developer who fell ill and had to stop. Some of the notes below are mine and some of the execution methods are his.** My goal… load several email list into a database and later, extract all of the email addresses that have the same email domain as a URL in my domain list. Example domain [login to view URL] [login to view URL] [login to view URL] Example text string chris,jackson,chris@[login to view URL],1234567897 billy,bob,bbob@[login to view URL],84881451 john,doe@[login to view URL],8814 Example saved results chris,jackson,chris@[login to view URL],1234567897 john,doe@[login to view URL],8814 I am looking for any line that has an email that matches a domain within my domain list. So if I have [login to view URL] it will extract every line where there is a @[login to view URL] email address. Domains are one per line. In my list I will have [login to view URL] but when it is being searched, you will code it to search for @[login to view URL] to ensure it pulls a valid email format and not ‘email@[login to view URL]’. IMPORTANT Once imported into the db there will be around 500 million records. I have a list of 400,000 domains that I want to scan against it. To help speed up the process, I will have already purged domains from my email list that I do NOT want to find a match for (mostly free email accounts). NOTES -- The Source files are .csv files that I renamed to .txt for this purpose. -- Some files have 1 column, and some have 3, 4, etc. Instead of trying to build a table to match the columns, we will treat each row of the CSV/text file as a single text string (as a single column) and import the entire row of data. -- Each file is not the same so we cannot assume column 1 will always be email. That is why we will import an entire row of data and store it as a single value within the database. After exporting, I can clean up the data. REQUIREMENTS -- The results need to save frequently instead of waiting until the task is complete. -- It must have a basic UI – no commands. There should be three buttons. 1) Import Data (this allows me to select the file I want to import). 2) Load Domains List (This is the list of URL's I want to find emails for). 3) Match Records (Looks through the DB to find emails strings that have a url that makes a domain in the URL list). -- It is also possible to have just two buttons and when I click Match, it asks me to select the [login to view URL] file. This is the file that has the URL’s I want each email address to include. -- Speed is very important. DEVELOPER NOTES Here are some key points from the previous developer who started it. -- Use C++ -- with some precomputation and indexing we can save a lot of time -- your issue looks mainly like an indexing issue, if you'll index emails with domain name you -- won't have to go through all data every time you search -- I talked with the DB Admin, and we are using nested Queries -- Assuming that we have an index loaded in memory, it should not take longer than a few minutes at most (I expect). -- As for the way we match the domains, we could either use a regular expression filtering out only the domain. If that is too slow, then I will store the address as a subset of the domain which should will definitely work. -- If there are any memory problems, we could use a stream instead of reading in the entire index in one go. This will be slower though, so first I'll try to get the entire index loaded into memory. In your bid, include the following the quoted text in the first line “I reviewed the notes and understand that speed is important to you.” To help me understand that you are the right person for the job, let me know how soon you can start, when you can finish, and how you plan to develop this. The more detail you provide, the more confident that I am that you are the best choice. Make your best bid first as I will not be overpaying for this task. Do not bid the maximum budget amount as my max is lower than that. Thanks!
Rif. progetto: 18341246

Info sul progetto

19 proposte
Progetto a distanza
Attivo 5 anni fa

Hai voglia di guadagnare un po'?

I vantaggi delle offerte su Freelancer

Imposta il tuo budget e le scadenze
Fatti pagare per il lavoro svolto
Delinea la tua proposta
La registrazione e le offerte sui lavori sono gratuite
Assegnato a:
Avatar dell'utente
Hello. I have good skills in .NET, C++ Programming, Database Development, Database Programming, MySQL. I have read your project description carefully and i can do it. I hope to work with you. Contact me please. Thank you.
$50 USD in 3 giorni
4,9 (22 valutazioni)
4,9
4,9
Avatar dell'utente
Hello, I reviewed the details and I understand that speed is important for you. I assume that you will search in millions of rows multiple times. Standard string matching algorithms will be too slow for that purpose. My purposal is using HashTable class in .Net. I had successfully used it for Captcha reading algorithm which scans millions of options to read text. This structure is extremely fast compared to other arrays. Filtering the domain names into a smaller array (hashtable or generic dictionary) first and applying regex secondly will be much faster than using regex or other string comparison methods. Please contact for details, as I will have questions about the use case. Thanks.
$222 USD in 5 giorni
5,0 (9 valutazioni)
4,4
4,4
19 freelance hanno fatto un'offerta media di $180 USD
Avatar dell'utente
Hello? How are you? I have good experiences in "Build Database to Store & Extract Data from Text Files (Easy $$)" as you can see my profile for these (.NET, C++ Programming, Database Development, Database Programming, MySQL). I have been working for 7 yrs in this scope. While we contract and work in our jobs, I will get paid once you have confirmed satisfied result. If I do not deliver satisfied result, I will never get paid from you. We can discuss more details to understand more easily if you have other infos. Hope to work with you. Thank you.
$155 USD in 3 giorni
4,9 (79 valutazioni)
6,6
6,6
Avatar dell'utente
I am expert who understands the value of time. I pride myself in my attention to detail. I am very hard working and aim to deliver in less time than quoted. I want to make you, my employer happy without changing my bid or playing games. If you are looking for a true Freelancer, I am the Right person for you. I am available almost 24-7 and am very responsive. I feel proud that I am a trusted Freelancer who pleases almost every single client. You can rest assure, your work will be delivered well in advance of others, with passion and accuracy. I guarantee you instant communication & responses when you need me. Why choose me? I think every client is the reason for my success. I only take projects which I am sure I can do quickly and at my FIRST BID. I promise to deliver 100% Satisfaction.I have more than 5 year of experience in .NET, C++ Programming, Database Development, Database Programming, MySQL. Please ping me for more discussion.
$300 USD in 10 giorni
4,8 (54 valutazioni)
6,2
6,2
Avatar dell'utente
I reviewed the notes and understand that speed is important to you. Hi, It will be achieved by developing a C# win-forms application with buttons and grid. I have 11 years of experience in professional IT development. I'm a full-stack developer specializing in C#, ASP.NET MVC, SQL Server, SSRS, LINQ, Entity Framework, Web API, WCF, Windows Services, Win-Forms, Bootstrap, Typescript, Angular, Jquery etc. I have used version controls GIT, VSTS, TFS, Bit-bucket and project management tools like JIRA, GIT ZEN HUB, Trello etc. Thanks
$222 USD in 10 giorni
5,0 (26 valutazioni)
6,1
6,1
Avatar dell'utente
I reviewed the notes and understand that speed is important to you. Hi, Nice to meet you. I read your job posting and I am very excited. I think your project is quite simple and I can finish your job in a week. You can check my ability in profile. Please give me a chance to work for you. Let's discuss details in private chat. Thanks.
$300 USD in 10 giorni
4,7 (36 valutazioni)
6,2
6,2
Avatar dell'utente
Hi, I have 14 years of database and software development experience. I can solve the performance problems of the project and complete the rest of the code with the highest quality. Lets talk about the details. Regards Fatih.
$177 USD in 10 giorni
5,0 (14 valutazioni)
5,7
5,7
Avatar dell'utente
“I reviewed the notes and understand that speed is important to you.” Hi, I have developed several such applications in the past for different clients, and can do the same job for you. I can develop this for you using C++ or Pascal, and this will be a Windows application that will work fast! Can deliver in 1 week or less, based on your requirements, with daily updates. Thanks for considering my bid. -Steve
$300 USD in 8 giorni
4,9 (36 valutazioni)
5,7
5,7
Avatar dell'utente
I reviewed the notes and understand that speed is important to you Hello, I'm a web and desktop developer with 8 years of experience in Database Administration & Desktop app, I read your project description. I can do what you want just get in touch with me and we can discuss more about your project. Best Regards Fouad
$10 USD in 15 giorni
4,9 (45 valutazioni)
5,9
5,9
Avatar dell'utente
Hi! My name is Ihor, I will be glad to help you with your project. I specialize in .NET development for 3+ years. Feel free to contact me any time to discuss details.
$172 USD in 10 giorni
5,0 (7 valutazioni)
4,6
4,6
Avatar dell'utente
Hello, I am interested in your project. I can develop you an application which populates your DB with emails as you want . If you are interested please contact me for the details. Here is my relevant skill set: -Undergraduate degree in computer science and engineering -Experience in C++/Java/C# -5 years of experience in web development -3 years of experience as DB admin (Oracle,PostgreSQL,MySQL) cheers.
$111 USD in 7 giorni
5,0 (8 valutazioni)
4,3
4,3
Avatar dell'utente
Hi i am very good at c# and database programming please text me so we can start and will assured quality and in time delivery
$172 USD in 10 giorni
5,0 (11 valutazioni)
3,8
3,8
Avatar dell'utente
Have a good experience and wouldlike to work with you here. High Quality + Fast Speed = Excellent Result + Business Success, this is my working style. I have gone through your Job post and I can understand your job requirement thoroughly. I have a total of 15 years of experience in Web Designing and Development and had completed a number of projects with some great graphics and User Interface so far. I have all the required skills and experience you need for the above Job. I have strong command over: * WordPress, PHP, Wordpress themeing, Plugin Development * Android and IOS all kind of mobile apps development *Responsive theme Design * HTML5, CSS3 , Jquery, Bootsrtap, Git, * Widget Development * Other CMS: Magento, Joomla, Expression Engine, Drupal etc. * I’m honest & trustworthy, dependable & fast learner. * I’ve over 7 years experience in Wordpress Website designing/development. * I am available 40 hours a week for your job. You can be assured of a quality communication and the quality of the work provided from my end. I’m looking forward to hearing from you soon. Thank you for considering my cover letter.
$94 USD in 10 giorni
3,4 (13 valutazioni)
4,5
4,5
Avatar dell'utente
hi i read all requirement and please share more detail i did 2 similar task i will provide 5 star rating work thanks
$300 USD in 10 giorni
4,4 (16 valutazioni)
3,6
3,6
Avatar dell'utente
I am confident I am the right candidate for this project as I have done many similar projects in the past. With years of experience in this field, I believe this project will be very easy for me.
$415 USD in 10 giorni
5,0 (3 valutazioni)
2,8
2,8

Info sul cliente

Bandiera: UNITED STATES
Lexington, United States
4,8
57
Metodo di pagamento verificato
Membro dal apr 6, 2011

Verifica del cliente

Grazie! Ti abbiamo inviato tramite email il link per richiedere il tuo bonus gratuito.
Non è stato possibile inviarti l'email. Riprova per piacere.
di utenti registrati di lavori pubblicati
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Caricamento anteprima
Autorizzazione per la geolocalizzazione concessa.
La tua sessione è scaduta ed è stato effettuato il log out. Accedi nuovamente per piacere.