Skip to content

Latest commit

 

History

History
33 lines (26 loc) · 1.38 KB

README.md

File metadata and controls

33 lines (26 loc) · 1.38 KB

Crawlers

This repo will not continue, you look this repo

All university crawlers, crawling university page then insert mongodb

MongoDB Document is like json on the bottom;

{
  "site": "ytuce.maliayas.com",
  "authorName": "Ali Mehmet",
  "authorLink": "ce.yildiz.edu.tr/alimehmet",
  "titleName": "Exam Questions",
  "titleLink": "ce.yildiz.edu.tr/alimehmet/news/123AB45",
  "content": "Exam is hard",
  "id": "ASD12123SDSF9IASKDASD",
  "date": "27.01.2017",
  "clock": "20:00",
  "status": "new"
}
Content should hash with md5, then assign id

University Computer Science or Engineer List

University Crawling Site Status
Yildiz Technical https://ytuce.maliayas.com/ Ok
Istanbul http://ce.istanbul.edu.tr/ Nope
Pamukkale http://www.pamukkale.edu.tr/bilgisayar Nope
Istanbul Technical http://www.bb.itu.edu.tr/ Nope