jobId | jobKey | companyName | companyId | companyProfileUrl | jobUrl | jobDate | scrapedSalary | jobTitle | descriptionId | scrapedBenefits | scrapedQualifications | scrapedLocation | source | jobDescriptionClean | jobDescriptionRaw | companyWebsite | companyIndustry | parsedAnnualSalaryAvg | parsedAnnualSalaryMin | parsedAnnualSalaryMax | finalZipcode | finalCity | finalState | parsedPhoneNumber | aiBenefits | aiRemote | aiSeniority | aiEmployment | aiCertifications | aiHardSkills | aiSoftSkills | aiQualifications | aiDegreeLevelMin | aiDegreeLevel | aiSocCode | aiSocTitle | aiNormalizedTitle |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxx | xxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxx | xxxxx | xxxxx | xxxxx | xxxxx | xxxxxxxxx | xx | xxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxx | xxxxxxxxxxx | xxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxx | xxxxxxxxxxxxxxx | xxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | xxxxxxxxxxxxxxxxxxxxxxxx |
Description
Detailed Data Dictionary: https://docs.google.com/spreadsheets/d/1JKUYZYPNZfcg5Ol9LTk8fwe5hwiu7c5DSn-3Wia7mo8/edit?gid=1071313126#gid=1071313126 Developed by a seasoned team of ML experts from Google, Meta, and Amazon and alumni of Stanford, Caltech, and Columbia, our AI-powered pipeline provides invaluable insights for HR tech, lead generation, market intelligence, and corporate development. With cutting-edge AI and LLMs, we transform raw job postings into actionable data, analyzing job titles, skills, predicted salaries, locations, and more. Each posting undergoes multi-layered processing, with GPU-driven models delivering daily, weekly, and monthly data for a balanced real-time and historical view. Our processing pipeline integrates advanced AI models: - Deduplication Model: Filters out exact and near-duplicates, ensuring unique, high-quality job data. - Title Taxonomy Model: Categorizes over 20 million titles into 50,000 standardized groups, simplifying analysis. - Skill & Qualification Models: Extracts hard skills, soft skills, certifications, and degrees, mapped to standard education levels and tailored by job context. - Job Category Model: Predicts work types (remote, onsite, hybrid), seniority, and employment types (full-time, contract). - Location Prediction Model: Parses or estimates job locations—ZIP code, city, state—using averaged lat/long values from search-based estimates for accuracy. - Salary Estimation Model: Predicts minimum, average, and maximum salaries, using parsed and AI-predicted values for a robust salary range. - Government Classification Models: Assigns SOC codes and titles to postings for structured role insights and regulatory compliance. - Human Feedback Model: An in-house team of annotators reviews job descriptions and AI outputs, refining model accuracy. Delivered through S3, FTP, and Google Drive, Canaria’s dataset provides flexibility in integration, with APIs available on request. Combining real-time AI with human validation, Canaria’s data delivers business-ready insights to meet evolving HR and market demands. Core Industry Applications - HR & Workforce Analytics: Access insights into salary trends, workforce demographics, and skill demands to drive strategic HR decisions. - Lead Generation: Identify target leads and hiring needs through granular job postings data. - Investment & Market Intelligence: Gain insights into competitor hiring strategies and industry shifts. - Education & Skill Development: Support curriculum development and training programs based on skill trends and emerging job requirements. - Corporate Development: Align growth strategies with real-time job market data. - Talent Sourcing: Streamline talent sourcing by identifying active job markets and regions with the highest demand for specific skills. - Job Market Forecasting: Analyze hiring trends and job postings data to forecast demand for specific roles and skills. - Economic Research: Provide labor market insights for economic studies, helping to assess job growth and employment shifts by industry.
Country Coverage
(1 country)Data Categories
- Job Postings Data
- LinkedIn Data
- Recruiting Data
- Job Market Data
- Indeed Data
Pricing
One-off purchase |
Available |
Monthly License |
Available |
Yearly License |
Available |
Usage-based |
Available |
Volumes
- Historical
- 700M
- Monthly Volume
- 10M
- Sources
- 50K
Does this product fit your data needs?
Get in touch with our team to start unlocking your data solutions.
Request Information