MailSpreader
π§ Automate LinkedIn-based mailings
The MailSpreader is an open-source Python project that automates the creation of a structured spreadsheet used for targeted email campaigns via Thunderbird. It collects and processes company information from LinkedIn, extracts relevant details, and formats them into a mailing-ready table.
π― Highlights
π‘ Feature | π Description |
---|---|
π― Targeted Lead Scraping | Collect LinkedIn profiles filtered by topic and location |
π Smart Data Extraction | Get company websites and summaries from LinkedIn βAboutβ sections |
π§ Email Retrieval | Use Google to discover emails via domain-based queries |
βοΈ Auto Summary Builder | Convert text into business blurbs: βspecialized inβ¦β |
π Mail-Ready Export | Generate Thunderbird-compatible spreadsheets |
βοΈ Workflow Overview
- Input Criteria: Define target themes and locations.
- Profile Collection: Scrape LinkedIn for matching companies.
- Data Extraction: Retrieve website and βAboutβ content from each profile.
- Email Retrieval: Use Google queries to find contact emails tied to domain names.
- Summary Construction: Parse and rephrase descriptions for clarity and consistency.
- Spreadsheet Generation: Output all data in a tabular format ready for Thunderbird.
π Example Spreadsheet Format
Company Name | Business Summary | |
---|---|---|
Example Corp | specialises in renewable energy solutions | contact@example.com |
TechSoft Ltd | specialises in SaaS for retail management | info@techsoft.io |
π Project Structure
mailspreader/
βββ data/ # Input/output data files
β βββ prospect_list.csv
βββ mailspreader/ # Core package
β βββ linkedin_scraper.py # Functions to scrape LinkedIn profiles
β βββ website_parser.py # Email extraction from websites
β βββ summarizer.py # Summary generation logic
β βββ exporter.py # Spreadsheet generation
βββ assets/ # Images and media for documentation
β βββ img/
βββ run.bat # Windows shell script to run the main program
βββ run.sh # Shell script to run the main program
βββ .gitignore
βββ LICENSE
βββ README.md
βββ requirements.txt
βββ main.py # Entry-point script