Abstract: A reliable resume-job matching system helps a company find suitable candidates from a pool of resumes, and helps a job seeker find relevant jobs from a list of job posts. However, since job seekers apply only to a few jobs, interaction records in resume-job datasets are sparse. Different from many prior work that use complex modeling techniques, we tackle this sparsity problem using data augmentations and a simple contrastive learning approach. ConFit first creates an augmented resume-job dataset by paraphrasing specific sections in a resume or a job post. Then, ConFit uses contrastive learning to further increase training samples from $B$ pairs per batch to $O(B^2)$ per batch. We evaluate ConFit on two real-world datasets and find it outperforms prior methods (including BM25 and OpenAI text-ada-002) by up to 19% and 31% absolute in nDCG@10 for ranking jobs and ranking resumes, respectively.
Paper Type: long
Research Area: NLP Applications
Contribution Types: NLP engineering experiment
Languages Studied: English, Chinese
Preprint Status: There is a non-anonymous preprint (URL specified in the next question).
A1: yes
A1 Elaboration For Yes Or No: 8
A2: yes
A2 Elaboration For Yes Or No: 8, 9
A3: yes
A3 Elaboration For Yes Or No: 1
B: yes
B1: yes
B1 Elaboration For Yes Or No: 4
B2: yes
B2 Elaboration For Yes Or No: 8, Appendix A
B3: yes
B3 Elaboration For Yes Or No: 4.1, 8
B4: yes
B4 Elaboration For Yes Or No: 8, Appendix A
B5: yes
B5 Elaboration For Yes Or No: 4.1, Appendix A
B6: yes
B6 Elaboration For Yes Or No: 4.1, Appendix A
C: yes
C1: yes
C1 Elaboration For Yes Or No: 4.3, Appendix D
C2: yes
C2 Elaboration For Yes Or No: 4.3, Appendix C, Appendix D
C3: yes
C3 Elaboration For Yes Or No: 4.5
C4: yes
C4 Elaboration For Yes Or No: Appendix A, Appendix E
D: no
D1: n/a
D2: n/a
D3: n/a
D4: n/a
D5: n/a
E: no
E1: n/a
0 Replies
Loading