Forked from tillson/gist:620e8ef87bc057f25b0a27c423433fda
Created
May 25, 2020 13:59
-
-
Save shamrocksu88/89a938e64a75551b5a70d5f03f66daa7 to your computer and use it in GitHub Desktop.
Revisions
-
tillson renamed this gist
May 10, 2020 . 1 changed file with 0 additions and 0 deletions.There are no files selected for viewing
File renamed without changes. -
tillson created this gist
May 10, 2020 .There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -0,0 +1,14 @@ from pydriller import RepositoryMining import re import base64 foundSet = set() for commit in RepositoryMining('./').traverse_commits(): for mod in commit.modifications: if mod.source_code_before != None: regex = re.findall(r"<text encoding=\"base64\">[^>]+</text>", mod.source_code_before) for result in regex: based = str(base64.b64decode(result[len("<text encoding='base64'>"):-len("</text>")])) if based not in foundSet: print(based) foundSet.add(based + "\n")