Custom PMD Rules

by Tom
Copeland
04/09/2003

A Review of PMD

A few weeks ago, O'Reilly Network ran an article on PMD, an open source, Java static-analysis tool sponsored under the umbrella of the Defense Advanced Research
Projects Agency (DARPA) project "Cougaar." That article covered some of the basics of PMD--it's built on an Extended Backus Naur Format (EBNF) grammar, from which JavaCC generates a parser
and JJTree generates an Java Abstract Syntax Tree (AST), and comes with a number of ready-to-run rules that you can run on your own source code. You can also write your own rules
to enforce coding practices specific to your organization.

In this article, we'll take a closer look at the AST, how it is generated, and some of its complexities. Then we'll write a custom PMD rule to find the creation of Thread objects. We'll write this custom rule two ways,
first in the form of a Java class, and then in the form of an XPath expression.

The AST

Recall from the first article that the Java AST is a tree structure that represents a chunk of Java source code. For example, here's a simple code snippet and the corresponding AST:

Source Code	Abstract Syntax Tree
`Thread t = new Thread();`	`FieldDeclaration Type Name VariableDeclarator VariableDeclaratorId VariableInitializer Expression PrimaryExpression PrimaryPrefix AllocationExpression Name Arguments`

Here we can see that the AST is a standard tree structure: a hierarchy of nodes of various types. All of the node types and their valid children are defined in the EBNF grammar file. For example, here's the definition of a FieldDeclaration:

void FieldDeclaration() :

{

}

{

  ( "public"            { ((AccessNode) jjtThis).setPublic( true ); }

  | "protected"         { ((AccessNode) jjtThis).setProtected( true ); }

  | "private"           { ((AccessNode) jjtThis).setPrivate( true ); }

  | "static"            { ((AccessNode) jjtThis).setStatic( true ); }

  | "final"             { ((AccessNode) jjtThis).setFinal( true ); }

  | "transient"         { ((AccessNode) jjtThis).setTransient( true ); }

  | "volatile"          { ((AccessNode) jjtThis).setVolatile( true ); } )*

  Type() VariableDeclarator() ( "," VariableDeclarator() )* ";"

}

A FieldDeclaration is composed of a Type followed by at least one VariableDeclarator; for example, int x,y,z = 0;. A FieldDeclaration may also be preceeded by a couple of different modifiers, that is, Java keywords like transient or private.
Since these modifiers are separated by a pipe symbol and followed by an asterisk, any number can appear in any order. All of these grammar rules eventually can be traced back to the Java Language Specification (JLS) (see the Referencessection
below).

A Custom Rule

Now that we've reviewed the AST a bit more, let's write a custom PMD rule. As mentioned before, we'll assume we're writing Enterprise Java Beans, so we shouldn't be using some of the standard Java library classes. We shouldn't open a FileInputStream,
start a ServerSocket, or instantiate a new Thread. To make sure our code is safe for use inside of an EJB container, let's write a rule that checks for Thread creation.

Writing a Custom Rule as a Java Class

Let's start by writing a Java class that traverses the AST. From the first article, recall that JJTree generates AST classes that support the Visitor pattern. Our class will register for callbacks when it hits a certain
type of AST node, then poke around the surrounding nodes to see if it's found something interesting. Here's some boilerplace code:

// Extend AbstractRule to enable the Visitor pattern

// and get some handy utility methods

public class EmptyIfStmtRule extends AbstractRule {

}

If you look back up at the AST for that initial code snippet--Thread t = new Thread();--you will find an AST type called an AllocationExpression. Yup, that sounds like what we're
looking for: allocation of newThread objects. Let's add in a hook to notify us when it hits a new [something] node:

public class EmptyIfStmtRule extends AbstractRule {

    // make sure we get a callback for any object creation expressions

    public Object visit(ASTAllocationExpression node, Object data){

       return super.visit(node, data);

    }

}

We've put a super.visit(node,data) in there so the Visitor will continue to visit children of this node. This lets us catch allocations within allocations, i.e., new Foo(new Thread()). Let's add in an if statement to exclude array allocations:

public class EmptyIfStmtRule extends AbstractRule {

    public Object visit(ASTAllocationExpression node, Object data){

    // skip allocations of arrays and primitive types:

    // new int[], new byte[], new Object[]

        if ((node.jjtGetChild(0) instanceof ASTName) {

            return super.visit(node, data);

        }

    }

}

We're not concerned about array allocations, not even Thread-related allocations like Thread[] threads = new Thread[];. Why not? Because instantiating an array of Thread object
references doesn't really create any new Thread objects. It just creates the object references. We'll focus on catching the actual creation of the Thread objects. Finally, let's
add in a check for the Thread name:

public class EmptyIfStmtRule extends AbstractRule {

    public Object visit(ASTAllocationExpression node, Object data){

        if ((node.jjtGetChild(0) instanceof ASTName &&

        ((ASTName)node.jjtGetChild(0)).getImage().equals("Thread")) {

            // we've found one!  Now we'll record a RuleViolation and move on

            ctx.getReport().addRuleViolation(

                createRuleViolation(ctx, node.getBeginLine()));

        }

        return super.visit(node, data);

    }

}

That about wraps up the Java code. Back in the first article, we described a PMD ruleset and the XML rule definition. Here's a possible ruleset definition containing the rule we just wrote:

<?xml version="1.0"?>

<ruleset name="My company's EJB checker rules">

  <description>

The Design Ruleset contains a collection of rules that find questionable designs.

  </description>

  <rule name="DontCreateThreadsRule"

        message="Don't create threads, use the MyCompanyThreadService instead"

        class="org.mycompany.util.pmd.DontCreateThreadsRule">

    <description>

Don't create Threads, use the MyCompanyThreadService instead.

    </description>

    <example>

<![CDATA[

 Thread t = new Thread(); // don't do this!

]]>

    </example>

  </rule>

</ruleset>

You can put this ruleset on your CLASSPATH or refer to it directly, like this:

java net.sourceforge.pmd.PMD /path/to/src xml /path/to/ejbrules.xml

Writing a Custom Rule as an XPath Expression

Recently Daniel Sheppard enhanced PMD to allow rules to be written using XPath. We won't explain XPath completely here--it would require a large book--but generally speaking, XPath is a way of querying an XML document. You can write an XPath query to get a
list of nodes that fit a certain pattern. For example, if you have an XML document with a list of departments and employees, you could write a simple XPath query that returns all the employees in a given department, and you wouldn't need to write DOM-traversal
or SAX-listener code.

Future Plans

There's still a lot of work to do on PMD. Now that this XPath infrastructure is in place, it might be possible to write an interactive rule editor. Ideally, you could open a GUI, type in a code snippet, select certain AST nodes, and an XPath expression that
finds those nodes would be generated for you. PMD can always use more rules, of course. Currently, there are over 40 feature requests on the web site just waiting for someone to implement them. Also, PMD has a pretty weak symbol table, so it occasionally picks
up a false positive. There's plenty of room for contributors to jump in and improve the code.

Conclusion

This article has presented a more in-depth look at the Java AST and how it's defined. We've written a PMD rule that checks for Thread creation using two techniques--a Java class and an XPath query. Give PMD a try and
see what it finds in your code today!

EBS Custom Password Rules
https://blogs.oracle.com/manojmadhusoodanan/entry/custom_password_rules Custom Password Rules By Man ...
[引]雅虎日历控件 Example: Two-Pane Calendar with Custom Rendering and Multiple Selection
本文转自:http://yuilibrary.com/yui/docs/calendar/calendar-multipane.html This example demonstrates how t ...
Android Weekly Notes Issue #235
Android Weekly Issue #235 December 11th, 2016 Android Weekly Issue #235 本期内容包括: 开发一个自定义View并发布为开源库的完 ...
Fedora 24中的日志管理
Introduction Log files are files that contain messages about the system, including the kernel, servi ...
Rails sanitize
The SanitizeHelper module provides a set of methods for scrubbing text of undesired HTML elements. T ...
关于 ant 不同渠道自动打包的笔记
必要的java.android.ant文件及循环打包用到的ant的jar 下载Ant(这里的Ant不是eclipse和android SDk里面自带的ant) 官方下载地址:http://a ...
windows下Android利用ant自动编译、修改配置文件、批量多渠道，打包生成apk文件
原创文章,转载请注明:http://www.cnblogs.com/ycxyyzw/p/4535459.html android 程序打包成apk,如果在是命令行方式,一般都要经过如下步骤: 1.用a ...
[Architect] ABP(现代ASP.NET样板开发框架) 翻译
所有翻译文档,将上传word文档至GitHub 本节目录: 简介代码示例支持的功能 GitHub 简介 ABP是“ASP.NET Boilerplate Project (ASP.NET样板项目) ...
[Tool] 使用StyleCop验证命名规则
[Tool] 使用StyleCop验证命名规则前言微软的MSDN上,有提供了一份微软的命名方针,指引开发人员去建立风格一致的程序代码. http://msdn.microsoft.com/zh-t ...
Android Studio 和 Gradle
由于以前没做过什么java项目,在使用Android Studio时遇到了Gradle,真是一头雾水,决定总结一下. 具体的使用方法请参看:http://www.cnblogs.com/youxilu ...

随机推荐

LogBack 没有打印日志
背景: 某日进行测试,新增了一行日志(项目使用的是logback) 报错: 无,就是不打印日志解决: 经过仔细查看代码,发现之前的人写代码的时候在其它类里面,将 private final Log ...
ES7学习笔记（五）动态映射
通常情况下,我们使用ES建立索引的步骤是,先创建索引,然后定义索引中的字段以及映射的类型,然后再向索引中导入数据.而动态映射是ES中一个非常重要的概念,你可以直接向文档中导入一条数据,与此同时,索引. ...
Storybook version8 智能化构建组件文档与单元测试
根据官方文档说法,storybook 是一个独立构建前端UI组件与页面的车间. Storybook is a frontend workshop for building UI components ...
小tips：使用vuecli2脚手架配置vant自定义主题
一:工程安装less.less-loader 配置版本如下: "devDependencies": { "less": "^3.0.4", ...
城市时空预测的统一数据管理和综合性能评估 [实验、分析和基准]《Unified Data Management and Comprehensive Performance Evaluation for Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]》
2023年11月1日,还有两个月,2023年就要结束了,希望在结束之前我能有所收获和进步,冲呀,老咸鱼. 论文:Unified Data Management and Comprehensive Pe ...
TypeScript 高级教程 – TypeScript 类型体操 (第三篇)
前言在第一部 – 把 TypeScript 当强类型语言使用和第二部 – 把 TypeScript 当编程语言使用后, 我们几乎已经把 TypeScript 的招数学完了. 第三部就要开始做 ...
26岁女生转行车载测试1年，月入15K~
年前有朋友找工作,跟我说简历改了车载后,收到的打招呼翻了几倍,如今车载测试前景非常广阔,因为越来越多的汽车厂商正在开发新的可智能化的汽车,他们需要测试这些汽车的性能,安全性以及可靠性.车载测试技术可以 ...
BOOST 定时器 stop探究
Asio是一个建立在Boost所提供的相关组件之上的异步的网络库,可以运行在Win/Linux/Unix等各种平台之上. 不过随着C++11的发布,其对Boost的依赖也越来越少,作者又做了一个不依赖 ...
[OI] 平衡树
1. 二叉查找树二叉查找树的思想和优先队列比较像,都是把若干个数据按一定规则插到一棵树里,然后就可以维护特定的信息. 在优先队列的大根堆实现里,我们让每棵子树的根节点都大于它的儿子,这样就可以保证根 ...
Java项目笔记（三）
一.前端传参类似以下格式,对象中包含一个对象,后台此时接收option为stirng类型 curriculumid question answer option {optionOne ,optionT ...

Custom PMD Rules

A Review of PMD

The AST

A Custom Rule

Writing a Custom Rule as a Java Class

Writing a Custom Rule as an XPath Expression

Future Plans

Conclusion

Credits

References

Custom PMD Rules的更多相关文章

随机推荐

热门专题