#CyberSecurityNLP : Machine-based Text Analytics of National Cybersecurity Strategies.

X Close

Prev | Next

Cybersecurity Analysis based on Machine Learning

by Haojunzhi Yu 03/01/2018 03:33 AM GMT

{{:upVoteCount}}

Move idea from "Expert Review" stage to:

Collapse

Do you want to send this idea to AdaptiveWork?

Collapse

Do you want to send this idea to Portfolios?

Parent structure code

Collapse

Which workspace template do you wish to use?

Collapse

I accept the terms and conditions (see side bar). I understand all content I am submitting must be licensed under an open-source software or Creative Commons license as described in the Terms and Conditions:

Description

We will focus on following improvements:

Data collection

Transfer data from PDF to text files.

Text extraction

Extract text contents from the documents.

Sentence tokenizing

Remove stop words, group sentences with similar meanings, and mark groups with certain labels.

Classification of category and subcategories

Classify each group based on meaning of sentences.

5. Interface for easily indicating the location of files

Implementation Method:

Tools/Platforms: Anaconda(Jupyter), SPSS Modeler

Python Packages: NLTK, pdfminer3k, Scikit Learn, etc

Main challenges:

Building proper dictionary

Classifying the policies

Making user-friendliness interface

Timeline:

Before February 28th: Idea
Before March 10th: Data collection & Methodology & Tools choosing
Before April 25th: Realization & Results
Before May 3th: Improvements
Final results & presentation

Expected Outcomes:

Building a user-friendliness dictionary

Co-authors to your solution

Zijing Yu, Qinruo Wu, Zihao Wang

Link to your concept design and documentation (Required by the final day of the Submission & Collaboration phase)

Link to an online working solution or prototype (Required by the final day of the Submission & Collaboration phase):

Link to a video or screencast of your solution or prototype (Required by the final day of the Submission & Collaboration phase):

Link to source code of your solution or prototype above. (If you submitted a link to an online solution or prototype, or to a video of your solution of prototype, you must provide a link to the source code. This item is required by the final day of the submission phase):

Tags: Cybersecurity,Classification,Tokenization,Interface,Text Analytics,Data Extraction

Move this Idea

Close this idea

When closing an idea, you must determine whether the idea has exited successfully or unsuccessfully.

Was the idea selected?

What is the Primary annual Impact?*

Quantify based on your selection*

What is the annual Secondary Impact?

Quantify based on your selection

What will the next steps be?*

Cancel Submit

Add Team Members

*Required

Cancel Add Now

Done

Help to Improve This Idea.

life cycle stages

33%

User Tasks

Required for graduation.
Task	Assigned to	Due Date	Status
Approval	Jorge Martinez-Navarrete	06/16/2018	Completed on 05/04/2018
Judge review	Nicolas Engel	05/16/2018	Incomplete
Judge review	Atef Elhady	05/16/2018	Incomplete

Terms & Conditions

Help to Improve This Idea.

legal.notice.title

View Idea

Cybersecurity Analysis based on Machine Learning

Move idea from "Expert Review" stage to:

Do you want to send this idea to AdaptiveWork?

Do you want to send this idea to Portfolios?

Which workspace template do you wish to use?

Move this Idea

Close this idea

Copy idea to another community

Team Members

Add Team Members

Comments

Help to Improve This Idea.

Tasks

Comparable Ideas

Activities

Terms & Conditions

Help to Improve This Idea.

legal.notice.title

Inbox

View Idea

Cybersecurity Analysis based on Machine Learning

Move idea from "Expert Review" stage to:

Do you want to send this idea to AdaptiveWork?

Do you want to send this idea to Portfolios?

Which workspace template do you wish to use?

Move this Idea

Close this idea

Copy idea to another community

Team Members

Add Team Members

Comments

Help to Improve This Idea.

Tasks

Comparable Ideas

Activities