LDP/LDP/ref/docbook/CVS-BestPractices/CVS-BestPractices.sgml

822 lines
28 KiB
Plaintext

<!DOCTYPE ARTICLE PUBLIC "-//OASIS//DTD DocBook V4.1//EN" [
<!-- Document version -->
<!ENTITY DOCVERSION "0.2">
<!-- File Includes -->
<!ENTITY GFDL-FILE SYSTEM "gfdl.sgml">
<!-- Text substitution macros -->
<!ENTITY CVS "Concurrent Versions System">
<!ENTITY OPENSOURCE "Open Source">
<!ENTITY CVSAB "CVS">
<!ENTITY SCMAB "SCM">
<!ENTITY SCM "Software configuration management">
<!ENTITY MYEMAIL "vivekv at users.sourceforge.net">
]>
<article>
<title>CVS Best Practices</title>
<articleinfo>
<author>
<firstname>Vivek</firstname>
<surname>Venugopalan</surname>
<affiliation>
<address><email>vivekv at users.sourceforge.net</email></address></affiliation>
</author>
<revhistory>
<revision>
<revnumber>0.2</revnumber>
<date>2001-11-27</date>
<authorinitials>vv</authorinitials>
<revremark>Incorporated first round of feedback and some minor fixes</revremark></revision>
<revision>
<revnumber>0.1</revnumber>
<date>2001-11-20</date>
<authorinitials>vv</authorinitials>
<revremark>Created</revremark>
</revision>
</revhistory>
</articleinfo>
<abstract>
<indexterm>
<primary>CVS Best Practices</primary>
</indexterm>
<para>This article explores some of the best practices that can be adopted
while using CVS as the configuration management tool in your software
projects.
</para>
</abstract>
<sect1 id="section1-intro">
<title>Introduction</title>
<blockquote>
<attribution>Henry David Thoreau (1817-1862)
</attribution>
<literallayout>Men have become the tools of their tools.
</literallayout>
</blockquote>
<para>This article outlines some of the best practices that can be adopted
when &CVS; is used as the configuration management tool in your software
project. </para>
<para>&CVS; (&CVSAB;) is an <ulink
url="http://www.opensource.org">&OPENSOURCE;</ulink> configuration management
tool that is now being looked at seriously by many commercial organizations as
a viable alternative to other commercial &SCM; tools. </para>
<para>This spotlight on &CVSAB; has led to the inevitable question of best
practices for deploying &CVSAB; as the backbone &SCMAB; tool for large
software development projects. Having answered this question many times
verbally as a bunch of <quote>gotchas</quote> on &CVSAB;, it was time to put
down on paper some of the best practices that will work well for
&CVSAB; based projects. </para>
<!-- Section2: copyright -->
<sect2 id="copyright">
<title>Copyright Information</title>
<para> This document is Copyright &copy; 2001 Vivek Venugopalan. Permission
is granted to copy, distribute and/or modify this document under the terms of
the <link linkend="gfdl"><citetitle>GNU Free Documentation
License</citetitle></link>, Version 1.1 or any later version published by the
Free Software Foundation with no Invariant Sections, no Front-Cover Texts, and
no Back-Cover Texts. A copy of the license can be found in <xref
linkend="gfdl">. </para>
<para> This document may be reproduced and distributed in whole or in part, in
any medium physical or electronic, as long as this copyright notice is
retained on all copies. Commercial redistribution is allowed and encouraged;
however, the author would like to be notified of any such distributions.
</para>
<para> All translations, derivative works, or aggregate works incorporating
this document must be covered under this copyright notice. That is, you may
not produce a derivative work from this document and impose additional
restrictions on its distribution. Exceptions to these rules may be granted
under certain conditions; please contact the author at the address given
below. </para>
<para> In short, we wish to promote dissemination of this information through
as many channels as possible. However, we do wish to retain copyright on the
document, and would like to be notified of any plans to redistribute the same.
</para>
<!-- Section2: disclaimer -->
<sect2 id="disclaimer"> <title>Disclaimer</title>
<para> No liability for the contents of this document can be accepted. Use
the concepts, examples and other content at your own risk. As this is a new
edition of this document, there may be errors and inaccuracies that may of
course be damaging to your system. Proceed with caution, and although this is
highly unlikely, the author(s) do not take any responsibility for that.
</para>
<para> All copyrights are held by their respective owners, unless specifically
noted otherwise. Use of a term in this document should not be regarded as
affecting the validity of any trademark or service mark. </para>
<para> Naming of particular products or brands should not be seen as
endorsements. </para>
<para> You are strongly recommended to take a backup of your system before
major installation and backups at regular intervals. </para> </sect2>
<!-- Section2: newversions-->
<sect2 id="newversions">
<title>New Versions</title>
<para> The version number of this document is &DOCVERSION;. </para>
<para> The latest version of this document can be obtained from </para>
<orderedlist>
<listitem>
<para>
<ulink url="http://www.geocities.com/vivekv/cvs-bestpractices/index-cvs-bestpractices.html">My website</ulink>
</para>
</listitem>
<listitem>
<para>
<ulink url="http://www.linuxdoc.org/doc.html">The linux documentation project</ulink>
</para>
</listitem>
</orderedlist>
</sect2>
<!-- Section2: credits -->
<sect2 id="credits">
<title>Credits</title>
<para>The list of people who have provided inputs and information for this
paper in no particular order are.
<orderedlist>
<listitem>
<para>Jens-Uwe Mager
</para>
</listitem>
<listitem>
<para>Jorgen Grahn
</para>
</listitem>
</orderedlist>
</para>
</sect2>
<!-- Section2: feedback -->
<sect2 id="feedback">
<title>Feedback</title>
<para> Feedback is most certainly welcome for this document. Without your
submissions and input, this document wouldn't exist. Please send your
additions, comments and criticisms to the following email address :
<email>&MYEMAIL;</email>. </para> </sect2>
</sect1>
<sect1 id="section1-focusareas">
<title>Focus Areas</title>
<para>The focus areas for best practice are
</para>
<orderedlist>
<listitem>
<para>GUI Tools
</para>
<itemizedlist>
<listitem>
<para>Use GUI tools for &CVSAB; client
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Developer Sandbox
</para>
<itemizedlist>
<listitem>
<para>Keep System clocks in Sync
</para>
</listitem>
<listitem>
<para>Do not share the sandbox
</para>
</listitem>
<listitem>
<para>Do not work outside the sandbox
</para>
</listitem>
<listitem>
<para>Stay in sync with the repository
</para>
</listitem>
<listitem>
<para>Cleanup after completion
</para>
</listitem>
<listitem>
<para>Check-in often
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Server Configuration
</para>
<itemizedlist>
<listitem>
<para>&CVSAB; Server side scripting
</para>
</listitem>
<listitem>
<para>&CVSAB; Server notification
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Branching and Merging
</para>
<itemizedlist>
<listitem>
<para>Assign ownership to trunk and branches
</para>
</listitem>
<listitem>
<para>Tag each release
</para>
</listitem>
<listitem>
<para>Create a branch after each release
</para>
</listitem>
<listitem>
<para>Make bug fixes to branches only
</para>
</listitem>
<listitem>
<para>Make patch releases from branches only
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Change propagation
</para>
<itemizedlist>
<listitem>
<para>Merge branch with the trunk after each release
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Software Builds
</para>
<itemizedlist>
<listitem>
<para>Build early and build often
</para>
</listitem>
<listitem>
<para>Automate build process completely
</para>
</listitem>
<listitem>
<para>All necessary files must be checked in before a build
</para>
</listitem>
</itemizedlist>
</listitem>
<listitem>
<para>Institutionalize in the Organization
</para>
<itemizedlist>
<listitem>
<para>Implement change management
</para>
</listitem>
<listitem>
<para>Make &CVSAB; usage part of developer's objectives
</para>
</listitem>
<listitem>
<para>Collect metrics on &CVSAB; usage
</para>
</listitem>
</itemizedlist>
</listitem>
</orderedlist>
</sect1>
<!-- Using GUI Tools -->
<sect1 id="section1-guitools">
<title>Using <acronym>GUI</acronym> Tools</title>
<para>The traditional interface available for CVS is the command-line client.
There has also been a slew of GUI client applications that can
<quote>talk</quote> to a &CVSAB; server. These GUI clients provide a
<quote>point and click</quote> interface to the &CVSAB; repository. This
paper recommends using such GUI clients during the initial deployment of
&CVSAB; in an organization.</para>
<para>Developers typically use integrated development environments that have
the CM tools integrated into them. These tools minimize the learning for the
developers about the intricacies of &CVSAB; usage and instead allow them to be
productive from day one. Developers who are accustomed to other CM tools will
find the &CVSAB; command-line interface daunting. The adoption and usage of
&CVSAB; can be improved by using GUI tools for &CVSAB; clients. </para>
<para> GUI tools for &CVSAB; are available at <ulink
url="www.cvsgui.org">www.cvsgui.org</ulink>. GUI interfaces are available for
most of the popular platforms (Windows, Mac and Linux). In addition, on the
Windows platform there is an <acronym>SCC</acronym> extension that allows
integration of &CVSAB; as the configuration control tool with popular
IDE.</para>
</sect1>
<!-- Developer Sandbox -->
<sect1 id="section1-devsandbox">
<title>Developer Sandbox</title>
<para>The developer <quote>sandbox</quote> is where each developer keeps his
or her working copy of the code base. In &CVSAB; this is referred to as the
working directory. This is where they build, test and debug the modules that
they are working on. A sandbox can also be the area where the staging build
or the production build is done. Changes made in the work area are checked
into the &CVSAB; repository. In addition, changes made in the repository by
others have to be updated in the sandbox on a regular basis. </para>
<para>The best practices related to developers sandbox are:
</para>
<sect2 id="section2-clockinsync">
<title>Keep System clocks in Sync</title>
<para>&CVSAB; tracks change to source files by using the timestamp on the
file. If each client system date and time is not in sync, there is a
definite possibility of &CVSAB; getting confused. Thus system clocks must be
kept in sync by use of a central time server or similar mechanism.
</para>
<para>&CVSAB; is designed from ground up to handle multiple timezones. As
long as the host operating system has been setup and configured correctly,
&CVSAB; will be able to track changes correctly.
</para>
</note>
</sect2>
<sect2 id="section2-dontshare">
<title>Do not share the sandbox</title>
<para>Sandboxes have to be unique for each developer or purpose. They
should not be used for multiple things at the same time. A sandbox can be a
working area for a developer or the build area for the final release. If
such sandboxes are shared, then the owner of the sandbox will not be aware
of the changes made to the files resulting in confusion. </para>
<para>In &CVSAB;, the sandbox is created automatically when a working copy is
checked out for a &CVSAB; project using the <command>cvs checkout
{project-name}</command> command. </para>
<para>In very large projects, it does not make sense for the developers to
check-out the entire source into the local sandbox. In such cases, they can
take the binaries generated by the build team on a regular basis for all those
components of the application that is not changed by them and only check-out
the parts that are built by the developer. </para>
<para>For example, in a Java project, the build team can keep the results of
their last successful build in a standard location in the form of JAR files on
the network file servers. Individual developers will use a standard classpath
setup that has the network drives mounted on standard paths. Thus the
developers will automatically get the latest version of the files as required
by them.</para>
</sect2>
<sect2 id="section2-workinside">
<title>Do not work outside the sandbox</title>
<para>The sandbox can be thought of as a controlled area within which &CVSAB;
can track for changes made to the various source files. Files belonging to
other developers will be automatically updated by &CVSAB;. Thus the developer
who lives within the sandbox will stand to gain a lot of benefits of
concurrent development. This benefit cannot be achieved for work done outside
a sandbox. </para>
</sect2>
<sect2 id="section2-syncup">
<title>Stay in Sync with the repository</title>
<para>To gain the benefits of working within a sandbox as mentioned above,
the developer must keep his or her sandbox in sync with the main
repository. A regular <command>cvs update</command> with the appropriate
tag or branch name will ensure that the sandboxes are kept up to date. </para> </sect2>
<sect2 id="section2-cleanupatcompletion">
<title>Cleanup after completion</title>
<para>Make sure that the sandbox is cleaned up after completion of the change.
This can be done in &CVSAB; by using the <command>cvs release</command>
command. This ensures that no old version of the code exists in the
development sandbox. As explained previously, pre-built binaries from the
build team can be used to ensure that all the parts of the application are
available to the developer without the need for a complete compilation in the
sandbox.
</para>
</sect2>
<sect2 id="section2-checkin">
<title>Check-in Often</title>
<para>To help other developers keep their code in sync with your code, you
must check-in your code often into the &CVSAB; repository. The best practice
would be to check-in soon as a piece of code is completed, reviewed and
tested, check-in the changes using a <command>cvs commit</command> to ensure
that your changes are committed to the &CVSAB; repository.
</para>
<para> &CVSAB; promotes concurrent development. Concurrent development is
possible only if all the other developers are aware of the ongoing changes on
a regular basis. </para>
<note>
<para>One of the <quote>bad</quote> practices that commonly occur is sharing
of source code files between developers by email. This works against most of
the best practices mentioned above. To share source code updates between two
developers, &CVSAB; must be used as the communication medium. This will
ensure that &CVSAB; is <quote>aware</quote> of the code change and can track
them. Thus, audit trail can be established if necessary. </para>
</note>
</sect2>
</sect1>
<sect1 id="section1-serverconfig">
<title>Server Configuration</title>
<para>This section deals with best practices for &CVSAB; server side setup and
configuration.
</para>
<sect2 id="section2-scripting">
<title>Server side scripting</title>
<para>Server side scripting refers to the ability to make &CVSAB; server
execute certain scripts when an event occurs. I am currently not aware of any
best practices in this area. Suggestions are welcome.
</para>
</sect2>
<sect2 id="section2-notification">
<title>Server Notification</title>
<para>The &CVSAB; server can be configured to notify through e-mails in the
event of a commit happening. This can be used to verify if code commits are
occurring during the course of a release build. If such commits occur, based
on the project policy, such commits can be ignored or the entire build
automatically restarted.</para>
</sect2>
</sect1>
<sect1 id="section1-branchmerge">
<title>Branching and Merging</title>
<para> Branching in &CVSAB; splits a project's development into separate,
parallel histories. Changes made on one branch do not affect the other
branches. Branching can be used extensively to maintain multiple versions
of a product for providing support and new features. </para>
<para> Merging converges the branches back to the main trunk. In a merge,
CVS calculates the changes made on the branch between the point where it
diverged from the trunk and the branch's tip (its most recent state), then
applies those differences to the project at the tip of the trunk. </para>
<sect2 id="section2-branchowner">
<title>Assign Ownership to Trunk and Branches</title>
<para>The main trunk of the source tree and the various branches should have a
owner assigned who will be responsible for. </para>
<orderedlist>
<listitem>
<para>Keep the list of configurable items for the branch or trunk.
</para>
<para>The owner will be the maintainer of the contents list for the branch or
trunk. This list should contain the item name and a brief description about
the item. This list is essential since new artifacts are always added to or
removed from the repository on an ongoing basis. This list will be able to
track the new additions/deletions to the repository for the respective branch.
</para>
</listitem>
<listitem>
<para>Establish a working policy for the branch or trunk.
</para>
<para>The owner will establish policies for check-in and check-out. The
policy will define when the code can be checked in (after coding or after
review etc.,). Who is responsible to merge changes on the same file and
resolve conflicts (the author or the person who recently changed the file).
</para>
</listitem>
<listitem>
<para>Identify and document policy deviations
</para>
<para>Policies once established tend to have exceptions. The owner will be
responsible for identifying the workaround and tracking/documenting the same
for future use. </para>
</listitem>
<listitem>
<para>Responsible for merge with the trunk
</para>
<para>The branch owner will be responsible for ensuring that the changes in
the branch can be successfully merged with the main trunk at a reasonable point
in time. </para>
</listitem>
</orderedlist>
</sect2>
<sect2 id="section2-tagrelease"> <title>Tag each release</title> <para>After
each release, the entire code base must be tagged with an identifier that
can help in uniquely identifying the release. A tag gives a label to the
collection of revisions represented by one developer's working copy
(usually, that working copy is completely up to date so the tag name is
attached to the <quote>latest and greatest</quote> revisions in the
repository). </para>
<para>The identifier for the tag should provide enough information to
identify the release at any point in time in the future. One suggested tag
identifier is of the form. </para>
<literallayout>
<literal>release_</literal>{major version #}_{minor version #}
</literallayout>
</sect2>
<sect2 id="section2-branchatrelease">
<title>Create a branch after each release</title>
<para>After each software release, once the &CVSAB; repository is tagged, a
branch has to be immediately created. This branch will serve as the bug fix
branch for that release. This branch is created only if the release is not
a bug fix or patch release. Patches that have to be made for this release
at any point in time in the future will be developed on this branch. The
main trunk will be used for ongoing product development. </para>
<para>With this arrangement, the changes in the code for the ongoing
development will be on the main trunk and the branch will provide a separate
partition for hot fixes and bug fix releases. </para>
<para>The identifier for the branch name can be of the form. </para>
<literallayout>
<literal>release_</literal>{major version #}_{minor version #}<literal>_patches</literal>
</literallayout>
</sect2>
<sect2 id="section2-bugfixbranches">
<title>Make bug fixes to branches only</title>
<para>This practice extends from the previous practice of creating a
separate branch after a major release. The branch will serve as the code
base for all bug fixes and patch release that have to be made. Thus, there
is a separate repository <quote>sandbox</quote> where the hot fixes and
patches can be developed apart from the mainstream development. </para>
<para>This practice also ensures that bug fixes done to previous releases do
not mysteriously affect the mainstream version. In addition, new features
added to the mainstream version do not creep into the patch release
accidentally. </para>
</sect2>
<sect2 id="section2-patchesfrombranches">
<title>Make patch release from branches only</title>
<para>Since all the bug fixes for a given release are done on its
corresponding branch, the patch releases are made from the branch. This
ensures that there is no confusion on the feature set that is released as
part of the patch release. </para>
<para>After the patch release is made, the branch has to be tagged using the
release tagging practice (see <xref linkend="section2-tagrelease">). </para>
</sect2>
</sect1>
<sect1 id="section1-chgpropagation">
<title>Change Propagation</title>
<para>Change propagation practices explore how changes made to one version of
the application are migrated to other living versions of the application.
</para>
<sect2 id="section2-mergebugfix">
<title>Merge branch with the trunk after release</title>
<para>After each release from a branch, the changes made to the branch
should be merged with the trunk. This ensures that all the bug fixes made
to the patch release are properly incorporated into future releases of the
application. </para>
<para>This merge could potentially be time consuming depending on the amount
of changes made to the trunk and the branch being merged. In fact, it will
probably result in a lot of conflicts in &CVSAB; resulting in manual merges.
After the merge, the trunk code base must be tested to verify that the
application is in proper working order. This must be kept in mind while
preparing the project schedule. </para>
<para>In the case of changes occuring on branches for a long period of time,
these changes can be merged to the main branch on a regular basis even
before the release is made. The frequency of merge is done based on certain
logical points in the branch's evolution. To ensure that duplicate merging
does not occur, the following practice can be adopted. </para>
<para>In addition to the branch tag, a tag called {branch_name}_MERGED
should be created. This is initially at the same level as the last release
tag for the branch. This tag is then <quote>moved</quote> after each
intermediate merge by using the <command>-F</command> option. This
eliminates duplicate merging issues during intermediate merges. </para>
</sect2>
</sect1>
<sect1 id="section1-softwarebuild">
<title>Software Builds</title>
<para>This section deals with the best practices for software builds. Build
is the process of creating the application binaries for a software release.
They are done in a periodic manner in development teams to provide baseline
binaries for daily work. </para>
<sect2 id="section2-bebo">
<title>Build Early and Build Often (<acronym>BEBO</acronym>) </title>
<para>A slight variation of this adage has been around in the &OPENSOURCE;
community called "Release Early and Release Often" for quite some time
albeit for a different reason. BEBO helps a development team identify
issues that can arise from checking in the wrong files. BEBO will address
integration issues at the application level that might have slipped passed
individual developer builds. It will also improve the team morale when they
see a working version of the application.</para>
<para>Builds must be done on a regular basis. There should be a dedicated
resource(s) assigned to do the same. The entire project team must be
trained to view the daily build as an important activity and not as a chore.
Builds must be completed without any failures on a regular basis. Build
failures must be a rare event and should be treated with utmost seriousness.
The project team should ensure that successful builds are top priority on
their agenda. The seriousness can be emphasised by setting up a penalty for
breaking the build. </para>
</sect2>
<sect2 id="section2-automate">
<title>Automate Build Process completely</title>
<para>Another key practice for software builds is to automate the build
process completely. The automation process must also include automatic
retrieval of the right source files from the &CVSAB; repository. This
ensures that the build process is completely repeatable and consistent. In
addition, the chances of a build with the wrong version of the application
source files are reduced to a large degree. </para>
<para>By automating the build process, the task of building often becomes
less burdensome. </para>
</sect2>
<sect2 id="section2-ensurecheckin">
<title>All necessary files must be checked before build</title>
<para>This adage sounds trivial at first but this problem is very common
even with experienced development teams due to oversight. The problem of
oversight cannot be easily addressed since the onus is on the individual
developer to ensure that his or her file has been checked in. This practice
should be drummed into the team in the form of trainings and pre-build
announcements to ensure that the right version of source code is available
in the repository. </para>
<para>Automated build process as explained above will help in catching this
problem to a certain degree since they will automatically take the source
code from the &CVSAB; repository and perform the software build. Any missed
items will surface during the build process itself (makefiles etc.,) or
during the regression testing of the product (older version of the file
checked in). </para>
<para>A penalty based system can be setup to handle wrong check-in. Having a
kitty for a post project party to which each person who makes a wrong check-in
will contribute a fixed amount will act a good penalty system. </para>
</sect2>
</sect1>
<sect1 id="section1-instprocess">
<title>Institutionalize &CVSAB; in the Organization</title>
<para>Here we will look at the best practices for institutionalizing &CVSAB;
usage in the organization.
</para>
<sect2 id="section2-chngmgmt">
<title>Implement Change Management Process</title>
<para>All organizations must implement a good Change management process
(<acronym>CMP</acronym>). A good CMP will define how changes are received
and recorded, tracked, executed and delivered. &CVSAB; provides version
control for your project. Change management addresses the <quote>bigger
picture</quote> of how enhancements and bugs are received, tracked and
closed. &CVSAB; will play a smaller but a very important part in this
entire picture. With a formal change management process in place in the
organization, tools such as &CVSAB; will be looked at as aiding this process
instead of acting as a general development overhead. </para>
<para>Change management is quite a vast topic that cannot be done justice
here. Please look up some websites on change management. </para>
</sect2>
<sect2 id="section2-objectives">
<title>Make &CVSAB; Usage part of Objectives</title>
<para>To institutionalize &CVSAB;, it can be made as part of the annual
objectives for the developer to use it as part of his or her project. In addition,
it can also be made as part of the objective for the project manager to deploy
&CVSAB; in his or her project. </para>
<para>Compliance of this can then be reviewed as part of the appraisal cycle
for the employee. </para>
</sect2>
<sect2 id="section2-metrics">
<title>Collect metrics on &CVSAB; usage</title>
<para>&CVSAB; usage metrics can be collected in terms of percentage of
deployment in the organization, project size handled etc., This information
will spur other line managers and program managers to look at &CVSAB; as a
tool that will aid them in their daily operations. </para>
</sect2>
</sect1>
<sect1 id="section1-conclusion">
<title>Conclusion</title>
<para>These best practices are meant to help software teams get a head start
on using &CVSAB; for their development. The ideas presented here have to be
constantly reviewed and evolved. I would like this to be a growing and
evolving document. Please send your comments and ideas to
<email>&MYEMAIL;</email> </para>
</sect1>
<!-- Include the GNU FDL -->
&GFDL-FILE;
</article>