Merge branch 'master' of github.com:Submitty/submitty.github.io

bmcutler · bmcutler · commit 2808db1747ba · 2018-02-15T15:01:27.000-05:00
diff --git a/_docs/instructor/commonAST.md b/_docs/instructor/commonAST.md
@@ -0,0 +1,68 @@
+---
+title: commonAST
+category: Instructor
+order: 9
+---
+
+commonast.py is a static analysis tool to count different programming language constructs. 
+The tool functions in two modes: **count mode** and **print mode**
+
+### Count Mode
+
+```
+python commonast.py lang nodeType arg filename1 filename2 .... filenamen 
+```
+will output the number of ```nodeType```s under the argument ```arg``` in the source file ```filename``` which is written in language ```lang```. It will also output valuable information about the AST depending on the outputOption. 
+
+Supported ```lang```s: 
+* -py 
+* -cpp 
+
+
+Supported ```nodeType```s: 
+* -For counts the number of for loops
+    * Supported args:  
+        * -Void 
+        
+* -While counts the number of while loops
+    * Supported args:  
+        * -Void 
+        
+* -Call counts the number of calls (of a certain name or just calls in general)
+    * Supported args:  
+        * -Void 
+        * Specific call name that we want to count (example: only count the number of calls to print) 
+
+(The infrastructure is there to count many more nodes. I'm only adding them to this once they've been tested against Sam's tool to make sure they're correct) 
+
+You can count nodes on any number of filenames. The number returned will be the sum number of nodeTypes in all of the filenames. 
+
+### Print Mode
+
+```
+python commonast.py outputOption lang filename 
+```
+
+Supported ```outputOption```s: 
+* -json or -JSON 
+
+### Example Calls: 
+
+```Python commonast.py -py –For -Void hw1.py ```
+Counts the number of for loops in hw1.py 
+
+```Python commonast.py -py –Call check1 hw1.py ```
+Counts the number of calls to the function "check1" in hw1.py. Function "check1" may or may not exist in hw1.py 
+
+```Python commonast.py -json -cpp hw1.cpp  ```
+Prints a json representation of the AST of hw1.cpp  
+
+
+### Additional Install Steps:
+Running this on python files will work with the standard submitty install. In order to run this on C++ files, there are some additional install steps:
+1. navigate to ```/usr/local/submitty/GIT_CHECKOUT_SUBMITTY/.setup```
+2. run ```git checkout commonAST``` to switch to the correct branch
+3. run ```python clangInstall.py``` This could take a few hours
+4. navigate to ```/usr/local/submitty/GIT_CHECKOUT_AnalysisTools/```
+5. run ```git checkout commonASTInitial``` to switch to the correct branch
+6. run ```sudo /usr/local/submitty/.setup/INSTALL_SUBMITTY.sh``` to complete the installation
diff --git a/_docs/instructor/static_analysis.md b/_docs/instructor/static_analysis.md
@@ -17,7 +17,7 @@ given assignment, supplying the type of feature to count, the feature itself,
 any number of source files, and optional configuration flags.  For example:
 
 ```
-"submitty_count --language python call print *.py"
+submitty_count --language python call print *.py
 ```
 
 _Note: `submitty_count` is an alias for a program installed on the
@@ -32,7 +32,7 @@ This example will output the number of calls to the function ``print`` in all
 of the Python source files in the current directory. Another example:
 
 ```
-"submitty_count -l c token Goto main.cpp"
+submitty_count -l c token Goto main.cpp
 ```
 
 This second example will output the number of occurrences of the token ``goto`` in the
@@ -48,7 +48,88 @@ ___
 
 ## Countable Features
 Currently, three feature types can be counted: tokens, nodes, and function calls.
+The countable features contained in a given file can be identified using ``submitty_diagnostics``, for example as follows:
 
+```
+/usr/local/submitty/SubmittyAnalysisTools/diagnostics -l python file.py
+```
+
+This tool outputs JSON data by default.
+An interactive view of the data can be produced by specifying HTML format:
+
+```
+/usr/local/submitty/SubmittyAnalysisTools/diagnostics -l python --format html file.py
+```
+
+For example, if you would like to count additions, but are unsure of which token to count, you could use a test file like:
+
+```
+# file.py
+print(1 + 1)
+```
+
+Running `/usr/local/submitty/SubmittyAnalysisTools/diagnostics -l python file.py` on this file will produce the following output:
+
+```
+{
+    "/absolute/path/to/file.py": {
+        "tokens": [
+            {
+                "end_col": 6,
+                "token": "Identifier",
+                "start_line": 2,
+                "start_col": 1,
+                "end_line": 2
+            },
+            {
+                "end_col": 7,
+                "token": "LeftParen",
+                "start_line": 2,
+                "start_col": 6,
+                "end_line": 2
+            },
+            {
+                "end_col": 8,
+                "token": "IntegerLiteral",
+                "start_line": 2,
+                "start_col": 7,
+                "end_line": 2
+            },
+            {
+                "end_col": 10,
+                "token": "Plus",
+                "start_line": 2,
+                "start_col": 9,
+                "end_line": 2
+            },
+            {
+                "end_col": 12,
+                "token": "IntegerLiteral",
+                "start_line": 2,
+                "start_col": 11,
+                "end_line": 2
+            },
+            {
+                "end_col": 13,
+                "token": "RightParen",
+                "start_line": 2,
+                "start_col": 12,
+                "end_line": 2
+            }
+        ],
+        "nodes" : { ... node data here ... }
+    }
+}
+```
+
+The ``token`` fields specify tokens that can be given to ``submitty_count``.
+Notice that a token ``Plus`` is present between two ``IntegerLiteral`` tokens.
+You could verify that this is the right token by looking at the ``start_line``, ``end_line``, ``start_col``, and ``end_col`` fields, which indicate on what row and column the tokens begin and end within the file.
+Once you are sure that the token is correct, you could count it within student submissions with ``submitty_count``:
+
+```
+submitty_count -l python *.py
+```
 
 ### Tokens
 
@@ -107,12 +188,6 @@ should be the first tool considered when writing an assignment that
 requires static analysis.  Only seek out more advanced options when
 necessary.
 
-__TODO: Insert instructions to produce the intermediate tokens so the
-instructor user can experiment.__
-
-__TODO: Insert link to list of valid tokens that can counted.__
-
-
 ### Nodes
 
 The next level of analysis enables counting _nodes_ within a parse tree, which
@@ -146,15 +221,15 @@ what kind of literal is present. This enables the counting of
 specific classes of node. For example:
 
 ```
-"submitty_count -l python literal *.py"
+submitty_count -l python literal *.py
 ```
 
 If run upon the code fragment from the start of this section, this will yield 3,
 counting all literals used within the code. Contrast:
 which will return `3`.  In contrast:
 
 ```
-"submitty_count -l python integer *.py"
+submitty_count -l python integer *.py
 ```
 
 will return `2`, as it will only count the integer literals.
@@ -169,12 +244,6 @@ counting approach. However, these features have different nodes in the parse
 tree, so by counting nodes with certain tags it is possible to easily
 distinguish them.
 
-__TODO: Insert instructions to produce a human readable version of the
-parse tree so the instructor user can experiment.__
-
-__TODO: Insert link to valid tags (& nodes?) that can counted.__
-
-
 ### Function Calls
 
 This method is a bit higher level: it attempts via a language-dependent method
@@ -185,9 +254,5 @@ method at RPI is determining the number of calls to the ``print`` function
 present in Python code, for example:
 
 ```
-"submitty_count call print -l py *.py"
+submitty_count call print -l py *.py
 ```
-
-__TODO: Insert instructions to produce a human readable version of the
-functions found in a specific program(?) so the instructor user can experiment.__
-