Browser Fuzzing

浏览器Fuzzing，private project。 so only functional codes will be pushed.

Sample_downloader（single thread） ————download repositories from github as web-samples 用来下载web样本

distributor.py 脚本用来提取下载好的仓库中的HTML，CSS，JS文件。

同时会对HTML文件中的JS，CSS部分做替换提取。

设置选项

146行 repo_path 是要提取的仓库路径

147行 output_dir 是要输出的目录路径。

_split 设置是否要为各个仓库单独创建文件夹

html2vectorMatrix.py 是函数文件

main.py里有使用示例，如何使用请看main.py

index.html是测试文件

all_tag是html标签字典


#coding:utf8

from html2vectorMatrix import *


tagList = getTagList() # 读取本地字典

html_str = readFile('./index.html') # 读取本地index.html文件

vector_matrix = generateMatrix(html_str,tagList) # 获取向量矩阵

对于不在字典中的token，会将其字符串输出，可以以此来分类或者扩充字典


html_string = matrix2string(vector_matrix,tagList) 

print(html_string)


analyzeMatrix(vector_matrix,tagList)

以上~ 函数示例，查看main文件即可

Provide feedback