Facebook AI researchers created code search data sets that utilize information from GitHub and Stack Overflow. The release contains an evaluation data set of 287 Stack Overflow question-and-answer pairs including code snippets, as well as a search corpus of code snippets from nearly 25,000 Android repositories on GitHub. spring
「We intend for this data set to serve as a benchmark for evaluating search quality across a variety of code search models,」 Facebook AI said in a blog post.post
The paper also shares results of two AI models created by Facebook as a test run of the corpus and data set.this
Code search is meant to give developers a way to surface chunks of programming language code using natural language. A number of code search initiatives are underway such as GitHub’s Semantic Code Project and machine topplaythai learning initiative and startups like recent Y Combinator graduate Metacode.lua
In other developments in AI for software developers, this spring Google Brain introduced AI that predicts code based on previous edits.3d