Abstract
Source code retrieval is a task under text retrieval which is performed by software developers regularly. The existing source code retrieval approaches are regular expression based and anticipate that the software developer querying the code base has an extensive acquaintance with the source code. Unlike keyword or regular expression based source code search which are difficult to remember, software developers should be able to query the code base in a sentential form. Although, performance of the search on text widely depends upon query quality, it succeeds when the quality of the textual query is high. Query quality prediction ahead of query execution on a source code retrieval system will save developers time and effort by notifying him/her when a query is unlikely to perform. This paper assesses the performance of prominent classification algorithms namely Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosted Tree (GBT) and Decision Tree (DT) to predict the query quality on a data set created from the documentation of the source code files. Experimental results using benchmark open source projects data set demonstrates that Gradient Boosted Tree performs better than others in comparison.
Original language | English |
---|---|
Title of host publication | 2018 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1115-1119 |
Number of pages | 5 |
ISBN (Electronic) | 9781538653142 |
DOIs | |
Publication status | Published - 30-11-2018 |
Event | 7th International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018 - Bangalore, India Duration: 19-09-2018 → 22-09-2018 |
Conference
Conference | 7th International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018 |
---|---|
Country/Territory | India |
City | Bangalore |
Period | 19-09-18 → 22-09-18 |
All Science Journal Classification (ASJC) codes
- Artificial Intelligence
- Computer Networks and Communications
- Computer Science Applications
- Information Systems