A Simplified C# Grammar

A language grammar is a set of small rules that can be combined to produce a syntax.

The many small rules in the C# programming language is its grammar. These rules specify the accepted way to write C# code. Individually, each rule in itself is not enough to produce complete working code. Instead, the rules are like building blocks, offering various ways to assemble or construct your code.

Take, for example, some of the grammar around variable declarations and program statements. A simplified grammar for declaring a variable in C# would look like this:

* dataType variableName*

We would have to supply both a data type and a variable name to declare a variable. Thus, if we wanted to store a whole number to represent a count, we could use the int keyword and write the following:

int count

By itself, this variable declaration is not enough to produce a complete instruction in C#. We have to combine the variable declaration grammar with the grammar for a program statement in order to make a single instruction telling the computer to create the variable. In general, a program statement must be a statement that ends with a semicolon (;). In other words, the program statement grammar requires the semicolon as “punctuation”, much in the same way that English sentences must end in a period. Thus, our completed syntax for creating a variable would be

int count;

The formal grammar for C# is actually quite large and complex. In fact, the grammars for most programming languages are so complex that they require another “language” to describe the grammar. An early and fairly standard language or means to express grammars is the Bakus-Naur Form, or BNF.

For our purposes, we will present a much more simplified grammar. The following sections express the bulk of the C# grammar used in this introduction to programming fundamentals in C#.

Common Grammar Elements

Most of the grammar rules in C# are quite short, defining the order of keywords, identifiers and symbols. In the following grammars, the use of square brackets ([]) indicate an optional part of the syntax for the grammar rule; those square brackets are not actual symbols in the final syntax of the grammar. With each of the rules is a very brief explanation. Later chapters will go into more depth and provide examples of how to use these grammars.

Program Statements and Statement blocks

Individual instructions are known as Program Statements. The instructions can be short and simple, or they can be long an complex. In either case, the program statement must end in a semicolon (;). These individual instructions run sequentially, one after the other, so the order of individual instructions is important.

Besides individual instructions, we can group individual statements into Statement Blocks. A statement block is a set of zero or more program statements that are enclosed in a set of curly braces ({ }). Statement blocks are frequently used with Flow Control Statements (such as the if and for statements).

Variable Declarations

Before a variable can be used, it must be declared. Declaring a variable tells the compiler to

set aside room in memory to store information,
treat that data as a specific data type, and
refer to that information by the variable name

The grammar for declaring variables is as follows.

1 dataType variableName [= expression] [, ...]

A Variable Declaration defines a new variable where: dataType is any built-in or programmer-defined data type.; variableName is; an optional initial value may be assigned, as denoted by [= expression], where expression is any valid C# expression whose final data type matches the variable’s data type. When a variable is declared and initialized at the same time, it is called a Variable Initialization.; additional variable names (with or without initial values) can be declared using a comma-separated list. All variables are of the same data type as the first variable in the list.

Assignment Operation

1 variableName assignmentOperator expression

Assignment Operations are operations where a value is assigned or stored in a variable.: variableName is the name of the variable that will receive/store the value.; assignmentOperator is one of the following:
= Equals
+= Plus-Equals
-= Minus-Equals
*= Multiply-Equals
/= Divide-Equals
%= Modulus-Equals; expression is any valid C# expression whose final data type matches the variable’s data type.; An assignment operation is made into an assignment statement by adding a semicolon to the end of the operation. For example,
total = price * quantity;

Expressions

An Expression is any combination of literal values, variable names, operators (such as the arithmetic operators), and/or method calls (where the method returns a value). When an expression is processed by the computer, a single value is produced. This value can then be used in whatever operation the expressions occurs. For example, the value might be passed into a method as part of a method call, or it might be placed in a variable as part of an assignment statement.

Namespace Declaration and Using Statements

1 namespace Name
2 {
3     // Classes, enumerations, or other programmer-defined data types
4 }

A Namespace Declaration groups programmer-defined data types where: The Name of the namespace can be one or more dot-separated names. For example, the following are all valid names for namespaces.
System
System.Collections
MyGame
MyGame.GameRules

Namespaces are used to group classes and other programmer-defined data types into a single named group. The reason for grouping programmer-defined data types into namespaces is to prevent what are called “naming collisions”. A naming collision is where two or more classes or other programmer-defined data types are given the same name. In other words, you cannot have two classes named Circle in the same namespace. However, if you place those two classes in different namespaces, then that is acceptable because the compiler will then be able to distinguish between the two classes based on the namespace they belong to.

Whenever a class or other data type is placed in a namespace, that namespace becomes part of the fully-qualified name of the data type. For example, if a class named Circle is placed in a namespace called Geometry.Shapes, then the fully qualified name of the class is Geometry.Shapes.Circle.

Namespaces allow us to isolate our classes and other data types into groups. All of the classes/data types in a given namespace can automatically reference each other. To reference or use data types in other namespaces, we must either use their fully-qualified names or include them through the Using Statement.

1 using NamespaceName;

The using statement allows access to all the data types in referenced namespace. Using statements are typically placed at the beginning of a file. The NamespaceName is simply the complete name of the namespace that contains the classes or data types that we want to access.

Classes and Class Members

As an object-oriented language, classes play a very prominent part of the code we write in C#. It is within classes, for example, that we place variables (also called fields) and methods (which are “named sets of instructions”). One of the first things that classes give us developers is a context or scope for the code that we write. Classes are also building blocks, acting as blueprints for new and complex data types that we as programmers can create as we develop richer and more complex computer programs. Classes permeate all the code that we write in C# and are so fundamental that you can’t even write a “Hello World” program without them.

Class Definition

1 [accessModifier] class ClassName
2 {
3   // FieldDeclarations
4   // PropertyDeclarations
5   // Constructors
6   // MethodDeclarations
7 }

A Class Definition describes a new data type where: [accessModifier] is either public{format: csharp} or internal{format:csharp}. If no access modifier is provided, then the default modifier is internal{format:csharp}.; ClassName is the programmer-supplied name for the class (in TitleCase format); FieldDeclarations, PropertyDeclarations, Constructors and MethodDeclarations are all optional and can appear in any order.
See the related grammars below to see how they are defined.

Field Declarations

1 [accessModifier] [static] dataType _FieldName [= constantExpression];

A Field Declaration identifies a static or instance member variable of the class where: [accessModifier] is either public{format: csharp}, private{format: csharp}, protected{format: csharp}, or internal{format:csharp}. If no access modifier is provided, then the default modifier is private{format:csharp}.; [static]{format:csharp} is an optional keyword. If present, the field is shared among all instances of the class. If absent (which is the common case) then the field is an instance member and one is created every time an object based on the class is created.; dataType is any built-in data type or the name of a programmer-defined data type.; _FieldName is a the name you give to the member variable. By convention, private fields are given an underscore as a prefix to the name.; constantExpression is an optional expression that generates data whose value can be determined at compile time. Being a constant expression does not mean that the field is a constant, only that the initial value stored in the field is a constant and can be known at compile time before the program runs.

Property Declarations

Properties are a kind of cross between methods and fields. On one hand, they are used in the same way that fields are. When you want to assign (or set) a value to a property, you place the property on the left side of the assignment operator. When you want to use (or get) the properties value, you simply reference the property name just as you would a field name.

Internally, however, the get and set operations are like the bodies of a method, where you can place instructions to retrieve or change the data in the class or object.

Explicitly Implemented Property

Explicitly implemented properties are properties where the programmer supplies the getter and setter implementations. The bodies of the getter and setter may reference a field (known as a backing store) that holds the actual information. In these cases, the property is working to provide a controlled access to the underlying field’s data.

In other situations, a property may merely have a getter where the body of the getter derives or calculates a value to return from some other source, such as a calculation.

 1 [accessModifier] [static] dataType PropertyName
 2 {
 3     get
 4     {
 5         // Body of getter
 6     }
 7     set
 8     {
 9         // Body of setter
10     }
11 }

A Property Declaration identifies a static or instance member of the class where: [accessModifier] is either public{format: csharp}, private{format: csharp}, protected{format: csharp}, or internal{format:csharp}. If no access modifier is provided, then the default modifier is private{format:csharp}.; [static]{format:csharp} is an optional keyword. If present, the Property is shared among all instances of the class. If absent (which is the common case) then the Property is an instance member.; dataType is any built-in data type or the name of a programmer-defined data type.; PropertyName is a the name you give to the member property.; Body of getter is a set of instructions that must ultimately return a value of the same data type as the property.; Body of the setter is a set of instructions that can process incoming data that is in the value{format:csharp} keyword. A typical implementation will store that data into the property’s backing store.

Autoimplemented Property

Autoimplemented properties are properties where the compiler takes care of the getter and setter implementations and also supplies a hidden field as the backing store for the property. The default get implementation is to retrieve the value from the backing store while the default set implementation is to place a value into the backing store.

1 [accessModifier] [static] dataType PropertyName { get; set; }

A Property Declaration identifies a static or instance member of the class where: [accessModifier] is either public{format: csharp}, private{format: csharp}, protected{format: csharp}, or internal{format:csharp}. If no access modifier is provided, then the default modifier is private{format:csharp}.; [static]{format:csharp} is an optional keyword. If present, the Property is shared among all instances of the class. If absent (which is the common case) then the Property is an instance member.; dataType is any built-in data type or the name of a programmer-defined data type.; PropertyName is a the name you give to the member property.

Method Declarations

1 [accessModifier] [static] returnType MethodName(ParameterList)
2 {
3     // body of method
4 }

A Method Declaration defines a named set of instructions.: [accessModifier] is either public{format: csharp}, private{format: csharp}, protected{format: csharp}, or internal{format:csharp}. If no access modifier is provided, then the default modifier is private{format:csharp}.; [static]{format:csharp} is an optional keyword. If present, the method is shared among all instances of the class. If absent (which is the common case) then the method is an instance member.; returnType is any built-in data type or the name of a programmer-defined data type. The return type signifies the kind of information that the method will return. If the method does not return any information, then the keyword void is used as the return type.; MethodName is a the name you give to the method.; ParameterList is a comma-separated list of individual variable declarations.

Constructors

1 [accessModifier] ClassName(ParameterList)
2 {
3     // body of constructor
4 }

A Constructor is a set of instructions used when instantiating (creating) an object.: [accessModifier] is either public{format: csharp}, private{format: csharp}, protected{format: csharp}, or internal{format:csharp}. If no access modifier is provided, then the default modifier is private{format:csharp}.; ClassName - All constructors use the name of the class to which they belong as the name of the constructor.; ParameterList is a comma-separated list of individual variable declarations.; Classes never return any information - they are simply blocks of instructions used to set up the initial state of the object.

Flow-Control Statements

If-Else
Case
For and Foreach
While and Do-While

Up next

Interlude 1 - Console Input/Output