Programming Topics + CVu Journal Vol 17, #5 - Oct 2005

Browse in :

All > Topics > Programming
All > Journals > CVu > 175
Any of these categories - All of these categories

Note: when you create a new publication type, the articles module will automatically use the templates user-display-[publicationtype].xt and user-summary-[publicationtype].xt. If those templates do not exist when you try to preview or display a new article, you'll get this warning :-) Please place your own templates in themes/yourtheme/modules/articles . The templates will get the extension .xt there.

Title: Let's Do C# and MySQL - Part 1 - MySQL

Author: Administrator

Date: 02 October 2005 06:00:00 +01:00 or Sun, 02 October 2005 06:00:00 +01:00

Summary:

Before I start, this is going to be fun. Understand? Fun. Not dull, but fun. I intend writing this in a banana suit with my feet in a bucket of warm rice pudding[1]. That's how much fun it's going to be!

Body:

Before I start, this is going to be fun. Understand? Fun. Not dull, but fun. I intend writing this in a banana suit with my feet in a bucket of warm rice pudding^[1]. That's how much fun it's going to be!

Objectives

What is the point of the series of articles? I intend it to be a learning progression to take you from the very basics of an application through to implementation. I've chosen to write a small MySQL front end in C# to do this. There will be a progression from setting up the MySQL database, testing using a command line application, moving it over to a System.Windows.Forms application and then adding in functionality. I will also look at using db4o. The question is though - why C#?

The MySQL API is very well documented and is accessible through many different programming languages with the minimum of fuss. I've chosen C# as I'm yet to find any really good tutorials for using the language. Sure, MSDN contains a great deal of information, but without the linking arguments, the information is almost context less.

Let's Make a Start

You will need a copy of MySQL installed on your computer to benefit from this series of articles and some form of C# compiler. The code works fine with both Mono and Microsoft C# - I have not tested the C# using gnu.NET. When compiling the code, Microsoft users should use csc and Mono users mcs. Any additional dlls to be used will be listed as -r:System.Drawing -r:System.Windows.Forms (though if you are using Visual Studio, these are normally be hidden).

Setting up MySQL

The first thing we'll need is a working MySQL database. I'm not going to assume that you've set up a database before now, but there are effectively two things you need to decide. What's going into it and how it's laid out. Take the following list of elements:

Element	Atomic Symbol	Description URL	Mass	Atomic Number
Copper	Cu	metals/copper	63.546	29
Chlorine	Cl	gases/chlorine	35.5	17
Neptunium	Np	actinides/neptunium	237	93

I could have one very large database which would contain parameters such as the period number, CAS registry, element block, shell filling, picture URL, if it is radioactive, man made and so on. This would be fine, but very messy and a pain to administer too.

A simpler solution is to break the database down and to do that, we need a couple of generic categories.

The list above gives three types of element: metal, gas and man-made. They are good enough and more can be added as and when I need them (such as trans-metal, halide and trans-actinide). But what of the actual data? For that, a second table is created which is linked to the generic first table. In the second table, the specifics of the element are held (such as discovery, relative atomic weight, shell filling, etc.).

Graphically, the database looks like this.

For the Element Info column, ID represents the classification ID - for example, metal = 1, gas = 2. Name is the generic name, Description a generic description and Image a generic picture. Element Info links to The Element in what is known as a 1 to many (sometimes known as 1 to infinity) relationship (1 item in the generic table can link to many items in the second table, as long as they match the type of the generic item). The Element list is straight forward to understand which again is linked to a third database (again via a 1 to many relationship) for other information.

Okay, what you should now be asking is why the other information should be in its own table?

When you think about it, the answer to that is simple. Say a new element is discovered (or created). While it is entirely possible to enter all of the details into one big table, discovery of elements and the dates are sometimes contested (especially for the trans-actinides where there was originally contested names for elements 104 to 111) - after the dust settles, it may be that the original name has to be changed, the discoverer changed and a few other details to boot. It is far simpler to have all of the "additional" information elsewhere so that the main database is still correct.

When considering a MySQL database, it could also be thought of as spreadsheet in style with columns and rows. A column will always be of a particular type which will be associated with a row. For example

The white blocks indicate the type associated. It's not quite how things are done, but is a convenient way to represent the database. (In reality, it's actually a two column table with the ID in column 1 and the type in column 2 - my representation is just handy - well, to me at least!)

MySQL Data Types

If you think of the MySQL server as a form of computer, you can quickly see that objects within the database can be thought of as variables and all variables need to be of a specific type (such as int, char and float). MySQL has specific data types for the components of a database. Effectively,there are three main types for MySQL : Date & Time, Numeric and String.

Date and Time

Field name	What it does	Range
DATE	It's a date. MySQL allows dates to be inputted as either a string or numeric.	1000-01-01 to 9999-12-31. Always in yyyy-mm-dd format.
DATETIME	Time and date. As with date, the fields can be entered as strings or numerics. Always displayed a yyyy-mm-dd HH:MM:SS	1000-01-01 00:00:00 to 9999-12-31 23:59:59
TIME	Time - has the format HH:MM:SS and can be set using strings or numeric values	-838:59:59 to 838:59:59
TIMESTAMP	This allows a timestamp to be added when either update or insert are used. TIMESTAMP is returned as a string of the format yyyy-mm-dd HH:MM:SS. TIMESTAMP will automatically add the time and date of the most recent operation if you don't set it.	1970-01-01 00:00:00 to midway through 2037.
YEAR[(2/4)]	Gives the year as either 2 or 4 digit values. Default 4 digit.	0000 and between 1901-2155 for 4 digit and 70-69 for 2 digit (represents 1970 to 2069).

String Types

Field name	What it does & Options	Range
CHAR(M)	A character string of size M. It is right space padded. Options BINARY (also written as BYTE), ASCII (assigns latin1), UNICODE (assigns ucs2). Trailing spaces are removed when retrieved. If M > 255, it is automatically converted to SMALLTEXT (From version 4.1.0) - this applies to other large values. CHAR can be preceded by NATIONAL. This tells the database to use the default character set for that nation.	0 - 255
CHAR	Synonym for CHAR(1)
VARCHAR(M)	A variable length string of maximum size M. Can be preceded by NATIONAL. Options : BINARY (stores the string as a binary set)	M = 0 to 255 (MySQL < 5.0.3)
BINARY(M)	Roughly the same as CHAR(M)
VARBINARY(M)	Roughly the same as VARCHAR(M) except is stored as binary byte string instead of non-binary character bytes.
TINYBLOB	A small blob column.	255 bytes
TINYTEXT	A small text column	255 bytes
BLOB[(M)]	A blob column. The server will create the smallest possible blob column for the size. If M < 255 then a TINYBLOB is used.	65535 bytes
TEXT[(M)]	A text column. The server will create the smallest possible blob column for the size. If M is < 255, then a TINYTEXT is used.	65535 bytes
MEDIUMBLOB	A medium sized blob column	16,777,215 bytes
MEDIUMTEXT	A medium sized text column	16,777,215 bytes
LONGBLOB	A somewhat large blob column	4,294,967,295 bytes (4 Gb)
LONGTEXT	A large text column	4,294,967,295
ENUM(c1,c2,..)	An enumeration column. It is very similar to C enumerations with the strings being held internally as integer values.	Max. 65535 enumerations
SET(c1, c2,...)	A string which can have more than one value (0 to n). The strings (c1, c2 to cn) are held internally as integers.	Max. 64 strings.

Numeric Values

Field name	What it is, options and aliases	Range
BIT [(M)]	A bit field type. If M is not specified, it is taken as 1.	1 - 64
TINYINT[(M)]	A very small integer. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL.	0 - 255 (unsigned) or -127 to 128 (signed).
BOOL, BOOLEAN	This is a synonym for TINYINT(1).	0 = false, non-zero = true.
SMALLINT[(M)]	A small integer range. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL.	-32767 - 32768 or 0 - 65535
MEDIUMINT[(M)]	A medium range integer. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL.	-8388608 - 8388607 or 0 - 16777215
INT[(M)]	A normal sized integer. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL. Can also be written as INTEGER.	-2147483648- 2147483647 or 0 - 4294967295
BIGINT[(M)]	A big integer. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL. Use with care (see the notes after this table).	-9223372036854775808 - 92233720368547758 or 0 - 18446744073709551615
FLOAT[(p)]	A floating point value. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL.	p = 0 - 24, single precision, p = 25 - 53, double precision
FLOAT[(M,D)]	A floating point figure. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL. M is the display width, D is the number of decimal places. If no values are set or FLOAT(p = 25 to 53) is not used (M can be blank), the value is single precision. Can also be written as REAL.	-3.402823466E+38 to -1.175494351E-38, 0 and 1.175494351E-38 to 3.402823466E+38
DOUBLE[(M,D)]	A double precision value. Can be signed (default) or UNSIGNed. Can also be set with ZEROFILL. M is the display width, D is the number of decimal places. Also can be written as DOUBLE PRECISION.	-1.7976931348623157E+308 to -2.2250738585072014E-308, 0, and 2.2250738585072014E-308 to 1.7976931348623157E+308
DECIMAL[(M[,D]) (MySQL 5.03)	A packed exact fixed point figure of length M and with D digits. If D is omitted, 0 is used (there can be no decimals or fractions with this value). If M is omitted, 10 is used. Can also be UNSIGNed and ZEROFILL. Also written as DEC[(M[,D]), FIXED[M[,D]) and NUMERIC[(M[,D]).	Max D = 30, Max M = 64
DECIMAL[(M[,D]) (pre MySQL 5.03)	An unpacked exact fixed point figure of length M and with D digits (it acts more like a CHAR column). All parameters and synonyms for the above apply.	Max D = 30, Max M = 64

Notes for BLOB

A BLOB is a Binary Large OBject - it can be just about any piece of binary data (picture, video or audio are examples). A BLOB can be useful in some circumstances (saving of avatars or to replace a directory of binary objects) but can also be a hindrance (increases the size of the database and may take longer to serve up the data).

Notes for BIGINT

All arithmetic is done using signed BIGINT or DOUBLE values, so you shouldn't use unsigned big integers larger than 9223372036854775807 (63 bits) except with bit functions! If you do that, some of the last digits in the result may be wrong because of rounding errors when converting a BIGINT value to a DOUBLE.

MySQL 4.0 can handle BIGINT in the following cases:

When using integers to store big unsigned values in a BIGINT column.
In MIN(col_name) or MAX(col_name), where col_name refers to a BIGINT column.
When using operators (+, -, *, and so on) where both operands are integers.
You can always store an exact integer value in a BIGINT column by storing it using a string. In this case, MySQL performs a string-to-number conversion that involves no intermediate double-precision representation.
The -, +, and * operators use BIGINT arithmetic when both operands are integer values. This means that if you multiply two big integers (or results from functions that return integers), you may get unexpected results when the result is larger than 9223372036854775807.

Setting Up the Database for Next Time

MySQL can be set up from the command line or by using something like mysqlcc [_1] or phpMyAdmin [_2]. Being a bit of a traditionalist, I'll go with the terminal windoUw (mainly as it also means that you don't need to download anything else!).

To call up MySQL, open a terminal window and type mysql -u root -p. This tells MySQL that you want to start up with the user "root" and with a password. The actual nuts and bolts of setting up MySQL isn't that hard.

If you've never used MySQL before, then the default password is nothing - just hit the return key. This is a hideous security hole and one which needs to be rectified as soon as you log in.

Once in, you'll see something similar to the image on the right.

If you've not set a password, then enter the following:

UPDATE user set password = PASSWORD("newpassword") 
where user = 'root'; FLUSH PRIVILEGES; exit;

(In this instance, newpassword is a password of your choice)

You now have a new password for the root user on your MySQL server. FLUSH PRIVILEGES clears all privileges on the server and exit does what it says on the tin - it exits the MySQL terminal.

That done, login to the server again. This time, let's examine what tables come by default.

mysql> show databases;

This will produce a table containing all of the databases the MySQL server is currently serving (right). You can see there is only 1 database on the MySQL server this series is being composed on. mysql is the database which contains details about users, passwords and the such.

When show databases; didn't show up the name of the database to be created, that gave the green light to say the database can be created.

Before creating the database (and as is common with creating anything on a computer), the database, linking, names and types should be written on paper first.

I have already specified earlier the names which I will use for the columns, but what about the types?

ID (here) is an INT and is also given the special type of PRIMARY_KEY when defining. The tables below show all of the types for all of the names.

The PRIMARY_KEYs all link to each other. KEY(1) is the "root" key which links to KEY(2). KEY(3) is linked from Others ID to the Catalogue table.

Category Info	Type	Size	Special
ID	INT		PRIMARY_KEY (1)
NAME	VARCHAR	30
DESCRIPTION	VARCHAR	30
IMAGE	VARCHAR	30

Element Info	Type	Size	Special
ID	LONGINT
Element Category	INT		PRIMARY_KEY (2)
Name	VARCHAR	50
Symbol	VARCHAR	5
Image	VARCHAR	50
Atomic weight	FLOAT(1)
Atomic Number	INT
Classification	VARCHAR	50
CAS registry	LONGINT
Block	VARCHAR	5	NON_ZERO
Outer shell	VARCHAR	50

Other	Type	Size	Special
ID	INT		PRIMARY_KEY (3)
Discovered	INT
Colour	VARCHAR	20
Oxidation states	VARCHAR	30
Allotropes	VARCHAR	200

At the end of the second table, there is an "others" id. This is the anchor from which the ID in the Others table is attached. It would be pointless to try and add further to these tables as they are pretty much in their clearest format.

VARCHAR has been used for the images. While I could have used BLOB, using VARCHAR allows me to say, "okay, the URL for the images is going to be fixed, this is just the filename".

Finally, I have "Last ordered" set as a DATE which is ZEROFILLed. This means I can set the date myself which is more useful than reliance on a TIMESTAMP (useful as it means I can say that I had the stock before the database was set up!)

Creating the Database

Simple enough.

mysql> create database theelements;
mysql> use theelements;
mysql> create table catagoryinfo (
  `id` smallint(6) NOT NULL auto_increment,
  `name` varchar(30) NOT NULL default '',
  `description` varchar(30) NOT NULL default '',
  `image` varchar(50) NOT NULL default '',
  PRIMARY KEY  (`id`)
);

mysql> create table elementinfo (
  `id` longint NOT NULL auto_increment,
  `elementcatagory` int NOT NULL,
  `name` varchar(50) NOT NULL default '',
  `symbol` varchar(5) NOT NULL default '',
  `image` varchar(50) NOT NULL default '',
  `atweight` float(1) NOT NULL,
  `atnumber` int NOT NULL,
  `classification` varchar(50) NOT NULL default '',
  `casreg` longint NOT NULL,
  `block` varchar(5) NOT NULL,
  `outershell` varchar(5) NOT NULL default '',
  PRIMARY KEY  (`id`)
);

mysql> create table other(
  `id` smallint(6) NOT NULL auto_increment,
  `discovered` int NOT NULL,
  `colour` varchar(20) NOT NULL default '',
  `oxstates` varchar(50) NOT NULL default '',
  `allotropes` varchar(200) NOT NULL default '',
  PRIMARY KEY  (`id`)
);

That's the tables set. Next stage is to get the data in. As the datasets are somewhat large for this article, I've placed them on my website [_3] - you will need to grab the file article1.zip and de-archive it; I would suggest saving it to your CSD^[2]. There are three files: catinfo.csv, elemitems.csv and other.csv. These will need importing into your database and it is very easy to do.

mysql> LOAD DATA LOCAL infile 'catinfo.csv' INTO TABLE catagoryinfo 
FIELDS TERMINATED BY ',' LINES TERMINATED BY '\n' (id, name, description, image);

Repeat for the other files. The only difference being to change the filename and the contents of the () to reflect the names given to the table rows. Once imported, you should get the following message back from the MySQL server

Query OK, 10 rows affected, 0 warnings (0.51 secs)
Records: 10 Deleted: 0 Skipped: 0 Warnings: 0

This means the data has been read in happily. A quick test to show that the data is indeed in the table should confirm this

mysql> select name,description from catagoryinfo;

This will display all of the category names and their descriptions from the catagoryinfo table. Similar tests can be performed for the other tables.

I'll leave this article at this point. Next time, I'll cover SQL manual insertion, manual alteration and manual deletion and introduce the C# command line interface which I will subsequently be developing.

I must thank the following people for helping me out on this project:

Paul Grenyer and Patrick De Ridder - both of whom have tested the compiled binaries under the .NET framework to ensure they have worked.
Novell - for the production and development of Mono [_4].
Steve Hopley - my MySQL tutor at St. Helens College [_5] and a good friend who helped me design the database these articles are based on.
Dr. Alex Woods - my former Physical Chemistry lecturer at Liverpool John Moores University, who inspired this due to his slightly "forgetful" nature.
Pippa Hennessy & Laurence Murphy - proof reading.

Disclaimer

No furry animals were hurt during the production of this article. I did step on something while chasing some information, but it didn't squeak, so I'm not going to count that.

Webliography

[_1] http://www.mysql.com/products/mysqlcc/

[_2] http://www.phpmyadmin.net/home_page/

[_3] http://www.all-the-johnsons.co.uk/accu/articles

[_4] Novell. The Mono Project. http://www.mono-project.com

[_5] St Helens College. http://www.sthelens.ac.uk

^[1] Your definition of fun may vary from that of the author. The ACCU cannot be held responsible for any discomfort felt that you may experience in the pursuit of what you consider fun. Rice pudding does not mix well with electricity, but is very nice with freshly made blackcurrant and gooseberry jam.

^[2] On my Linux machines, that would be /home/paul. I would suggest for those using Windows using C:\ rather than anywhere else.

Notes:

More fields may be available via dynamicdata ..