API Design Principles: Difference between revisions

From Qt Wiki
Jump to navigation Jump to search
No edit summary
 
No edit summary
Line 1: Line 1:
'''English''' [[API-Design-Principles-Russian|Русском]]


=<span class="caps">API</span> Design Principles=
One of Qt’s most reputed merits is its consistent, easy-to-learn, powerful <span class="caps">API</span>. This document tries to summarize the know-how we’ve accumulated on designing Qt-style <span class="caps">API</span>s. Many of the guidelines are universal; others are more conventional, and we follow them primarily for consistency with existing <span class="caps">API</span>s.
Although these guidelines are aimed primarily at public <span class="caps">API</span>s, you are encouraged to use the same techniques when designing internal <span class="caps">API</span>s, as a courtesy to your fellow developers.
You may also be interested to read Jasmin Blanchette’s [http://www4.in.tum.de/~blanchet/api-design.pdf Little Manual of <span class="caps">API</span> Design] ''[.in.tum.de]'' or its predecessor [http://doc.qt.digia.com/qq/qq13-apis.html Designing Qt-Style C++ <span class="caps">API</span>s] ''[doc.qt.digia.com]'' by Matthias Ettrich.
==Six Characteristics of Good <span class="caps">API</span>s==
An <span class="caps">API</span> is to the programmer what a <span class="caps">GUI</span> is to the end-user. The ‘P’ in <span class="caps">API</span> stands for “Programmer”, not “Program”, to highlight the fact that <span class="caps">API</span>s are used by programmers, who are humans.
In his [http://doc.qt.nokia.com/qq/qq13-apis.html Qt Quarterly 13 article about <span class="caps">API</span> design] ''[doc.qt.nokia.com]'', Matthias tells us he believes that <span class="caps">API</span>s should be minimal and complete, have clear and simple semantics, be intuitive, be easy to memorize, and lead to readable code.
====Be minimal====
A minimal <span class="caps">API</span> is one that has as few public members per class and as few classes as possible. This makes it easier to understand, remember, debug, and change the <span class="caps">API</span>.
====Be complete====
A complete <span class="caps">API</span> means the expected functionality should be there. This can conflict with keeping it minimal. Also, if a member function is in the wrong class, many potential users of the function won’t find it.
====Have clear and simple semantics====
As with other design work, you should apply the principle of least surprise. Make common tasks easy. Rare tasks should be possible but not the focus. Solve the specific problem; don’t make the solution overly general when this is not needed. (For example, QMimeSourceFactory in Qt 3 could have been called QImageLoader and have a different <span class="caps">API</span>.)
====Be intuitive====
As with anything else on a computer, an <span class="caps">API</span> should be intuitive. Different experience and background leads to different perceptions on what is intuitive and what isn’t. An <span class="caps">API</span> is intuitive if a semi-experienced user gets away without reading the documentation, and if a programmer who doesn’t know the <span class="caps">API</span> can understand code written using it.
====Be easy to memorize====
To make the <span class="caps">API</span> easy to remember, choose a consistent and precise naming convention. Use recognizable patterns and concepts, and avoid abbreviations.
====Lead to readable code====
Code is written once, but read (and debugged and changed) many times. Readable code may sometimes take longer to write, but saves time throughout the product’s life cycle.
Finally, keep in mind that different kinds of users will use different parts of the <span class="caps">API</span>. While simply using an instance of a Qt class should be intuitive, it’s reasonable to expect the user to read the documentation before attempting to subclass it.
==Static Polymorphism==
Similar classes should have a similar <span class="caps">API</span>. This can be done using inheritance where it makes sense — that is, when run-time polymorphism is used. But polymorphism also happens at design time. For example, if you exchange a QProgressBar with a QSlider, or a QString with a QByteArray, you’ll find that the similarity of <span class="caps">API</span>s makes this replacement very easy. This is what we call “static polymorphism”.
Static polymorphism also makes it easier to memorize <span class="caps">API</span>s and programming patterns. As a consequence, a similar <span class="caps">API</span> for a set of related classes is sometimes better than perfect individual <span class="caps">API</span>s for each class.
In general, in Qt, we prefer to rely on static polymorphism than on actual inheritance when there’s no compelling reason to do otherwise. This keeps the number of public classes in Qt down and makes it easier for new Qt users to find their way around in the documentation.
==Good==
<nowiki>:QDialogButtonBox and QMessageBox have similar </nowiki><span class="caps">API</span>s for dealing with buttons (addButton(), setStandardButtons(), etc.), without publicly inheriting from some “QAbstractButtonBox” class.
==Bad==
<nowiki>:QAbstractSocket is inherited both by QTcpSocket and QUdpSocket, two classes with very different modes of interaction. Nobody seems to have ever used (or been able to use) a QAbstractSocket pointer in a generic and useful way.</nowiki>
==Dubious==
<nowiki>:QBoxLayout is the base class of QHBoxLayout and QVBoxLayout. Advantage: Can use a QBoxLayout and call setOrientation() in a toolbar to make it horizontal/vertical. Disadvantages: One extra class, and possibility for users to write ((QBoxLayout *)hbox)-&gt;setOrientation(Qt::Vertical), which makes little sense.</nowiki>
==Property-Based <span class="caps">API</span>s==
Newer Qt classes tend to have a “property-based <span class="caps">API</span>”. E.g.:
By ''property'', we mean any conceptual attribute that’s part of the object’s state — whether or not it’s an actual Q_PROPERTY. When practicable, users should be allowed to set the properties in any order; i.e., the properties should be orthogonal. For example, the preceding code could be written
For ‘‘convenience‘‘, we can also write
Similarly, for QRegExp, we have
To implement this type of <span class="caps">API</span>, it pays off to construct the underlying object lazily. E.g. in QRegExp’s case, it would be premature to compile the “***.*” pattern in setPattern() without knowing what the pattern syntax will be.
Properties often cascade; in that case, we must proceed carefully. Consider the “default icon size” provided by the current style vs. the “iconSize” property of QToolButton:
Notice that once we set iconSize, it stays set; changing the current style doesn’t change a thing. This is '''good'''. Sometimes, it’s useful to be able to reset a property. Then there are two approaches:
* pass a special value (such as QSize(), -1, or Qt::Alignment(0)) to mean “reset”
* have an explicit resetFoo() or unsetFoo() function
For iconSize, it would be enough to make QSize() (i.e., QSize(-1, -1)) mean “reset”.
In some cases, getters return something different than what was set. E.g. if you call widget-&gt;setEnabled(true), you might still get widget-&gt;isEnabled() return false, if the parent is disabled. This is OK, because that’s usually what we want to check (a widget whose parent is disabled should be grayed out too and behave as if it were disabled itself, at the same time as it remembers that deep inside, it really is “enabled” and waiting for its parent to become enabled again), but must be documented properly.
==C++ Specifics==
===Value vs. Object===
===Pointers vs. References===
Which is best for out-parameters, pointers or references?
Most C++ books recommend references whenever possible, according to the general perception that references are “safer and nicer” than pointers. In contrast, we at Qt Software tend to prefer pointers because they make the user code more readable. Compare:
Only the first line makes it clear that there’s a high probability that h, s, and v will be modified by the function call.
===Virtual Functions===
When a member function of a class is declared virtual in C++, it’s primarily to allow customizing the behavior of the function through overriding it in a custom subclass. The purpose of making the function virtual is so existing calls to that function will visit your code path instead. If nobody outside of the class calls this function, you should be very careful before you declare it as virtual.
When QTextEdit was ported from Qt 3 to Qt 4, almost all virtual functions were removed. Interestingly (but not unexpected), there were no big complaints Why? Because Qt 3 didn’t make use of polymorphism for QTextEdit; Qt 3 doesn’t call these functions – you do. In short, there was no reason to subclass QTextEdit and reimplement these functions unless you called these functions yourself. If you needed polymorphism in your application outside of Qt, you would add polymorphism yourself.
====Avoiding virtual functions====
In Qt, we try to minimize the number of virtual functions for a number of reasons. Each virtual call complicates bugfixing through inserting an uncontrolled node in the call graph (making the outcome somewhat unpredictable). People do crazy things from inside a reimplementation of a virtual function, such as:
* sending events
* emitting signals
* reentering the event loop (e.g., by opening a modal file dialog)
* deleting the object (i.e., somehow causing “delete this”)
There are many other reasons to avoid excessive use of virtual functions:
* you cannot add, move or remove virtual functions without breaking BC
* you cannot easily override a virtual function
* compilers can almost never optimize or inline calls to virtual functions
* calling the function requires a v-table lookup, making it 2-3 times slower than a normal function
* virtual functions make the class hard to copy by value (possible, but very messy and discouraged)
Experience has taught us that a class with no virtual functions tends to have fewer bugs and generally causes less maintenance.
A general rule of thumb is that unless we as a toolkit and primary users of this class call that function, it should probably not be virtual.
====Virtualness vs. copyability====
Polymorphic objects and value-type classes are not good friends.
Classes with virtual functions must declare a virtual destructor to avoid memory leaks as the base class is destroyed without cleaning up data in the subclass.
If you want to be able to copy and assign to a class, or compare by value, you probably need a copy constructor, an assignment operator and an equals-operator.
If you create subclasses of this class, unexpected things can start happening in your code. Normally, if there are no virtual functions and no virtual destructor, people cannot not create a subclass and rely on polymorphism. However if you add virtual functions, or a virtual destructor, there suddenly becomes a reason to create the subclass, and now things get complicated. ''At first glance it’s easy to think you can simply declare virtual operators''. But wandering down this path can and will lead to chaos and destruction (read: unreadable code). Studying the following example:
(this section is under construction)
===Constness===
C++ provides the keyword “const” to signify that something will not change or have side effects. This applies to simple values, to pointers and what’s pointed to, and as a special attribute to functions that don’t change the state of the object.
Note however that const does not provide much value in itself – many languages don’t even provide any “const” keyword, but that doesn’t automatically render them deficient for that reason. In fact, if you remove function overloads and use search and replace to remove all occurrances of the keyword “const” from your C++ source code, it’s very likely to compile and work just fine. It’s important to keep a pragmatic approach to the use of “const”.
Let’s walk through some areas that use “const” that are relevant to <span class="caps">API</span> design in Qt:
====Input arguments: const pointers====
Const functions that take input pointer arguments almost always take const pointer arguments.
If the function is really declared const, it means it will neither have side effects, nor alter the visible state of its object. So why should it require a non-const input argument? Remember that const functions are often called from within other const functions, and from there, non-const pointers are hard to come by (without a const_cast, and we really like to avoid const_cast where we can).
Before:
QWidget declares many const functions that take non-const pointer input arguments. Note that the function is allowed to modify the widget, but not itself. Functions like these are often accompanied by const_casts. It would have been nice if these functions took const pointer arguments:
After:
Note that we fixed this in QGraphicsItem, but QWidget must wait until Qt 5:
====Return values: const values====
The result of calling a function that does not return a reference is an R-value.
Non-class R-values always have cv-unqualified type. So even if it is syntactically<br /> possible to add a “const” on them it does not make much sense as it won’t change <br /> anything regarding access rights.<br /> Most modern compilers will print a warning when compiling such code.
When adding a “const” to a class type R-values access to non-const<br /> member functions is prohibited as well as direct manipulation of it members.
Not adding a “const” allows such access, but is rarely needed as the changes<br /> end with the life time of the R-value object, which will usually happen at the<br /> end of the full-[removed]loosely spoken “at the next semicolon”).
Example:
====Return values: pointers vs. const pointers====
On the subject of whether const functions should return pointers or const pointers, this is where most people find that the concept of “const correctness” falls apart in C++. The problem starts when const functions, which do not modify their own state, return a non-const pointer to a member. The simple act of returning this pointer does not affect the object’s visible state, nor does it change the state of its responsibilities. But it does give the programmer indirect access to modify the object’s data.
This example shows one of the many ways to circumvent constness using const functions that return non-const pointers:
Functions that return const pointers do protect against this (perhaps unwanted / unexpected) side-effect, at least to a certain degree. But which functions would you prefer to return a const pointer, or a list thereof? If we take the const-correct approach, every const function that returns a pointer to one of its members (or a list-of-pointers-to-members), must return a const pointer. In practise this unfortunately leads to unusable <span class="caps">API</span>s:
In Qt we use the non-const pattern almost exclusively. We’ve chosen a pragmatic approach: Returning const pointers is more likely to result in excessive use of const_cast than what problems arise from abusing non-const pointer return types.
====Return values: by value or const reference?====
If we hold a copy of the object to return, returning a const reference is the fastest approach; however, this restrains us later on if we want to refactor the class. (Using the d-pointer idiom, we can change the memory representation of Qt classes at any time; but we cannot change a function’s signature from “const QFoo &amp;” to “QFoo” without breaking binary compatibility.) For this reason, we generally return “QFoo” rather than “const QFoo &amp;”, except in a few cases where speed is critical and refactoring isn’t an issue (e.g. QList::at()).
====Const vs. the state of an object====
Const correctness is a vi-emacs discussion in C++, because the topic is broken in several areas (such as pointer-based functions).
But the general rule is that a const function does not alter the visible state of a class. State means “me and my responsibilities”. That’s doesn’t mean that non-const functions change their own private data members, nor that const functions cannot. But that the function is active, and has visible side effects. const functions in general do not have any visible side effects. Like:
A delegate is responsible for drawing onto something else. Its state includes its responsibilities, and therefore includes the state of what it draws upon. Asking it to draw does have side effects; it changes the appearance (and with that, the state) of the device it’s painting on. Because of that, it does not make sense that paint() is const. Neither does it make sense that any of Interview’s paint()s or QIcon’s paint() are const. Nobody would call QIcon::paint() from inside a const function unless they explicily want to void the constness of that function. And in that case, an explicit const_cast is better.
The const keyword does no “work” for you. Consider removing them rather than having overloaded const/non-const versions of a function.
==<span class="caps">API</span> Semantics and Documentation==
What should you do when you pass -1 to a function? etc…
Warnings/fatals/etc
<span class="caps">API</span>s need quality assurance. The first revision is never right; you must test it. Make use cases by looking at code which uses this <span class="caps">API</span> and verify that the code is readable.
Other tricks include having somebody else use the <span class="caps">API</span> with or without documentation and documenting the class (both the class overview and the individual functions).
==The Art of Naming==
Naming is probably the single most important issue when designing an <span class="caps">API</span>. What should the classes be called? What should the member functions be called?
===General Naming Rules===
A few rules apply equally well to all kinds of names. First, as I mentioned earlier, do not abbreviate. Even obvious abbreviations such as “prev” for “previous” don’t pay off in the long run, because the user must remember which words are abbreviated.
Things naturally get worse if the <span class="caps">API</span> itself is inconsistent; for example, Qt 3 has activatePreviousWindow() and fetchPrev(). Sticking to the “no abbreviation” rule makes it simpler to create consistent <span class="caps">API</span>s.
Another important but more subtle rule when designing classes is that you should try to keep the namespace for subclasses clean. In Qt 3, this principle wasn’t always followed. To illustrate this, we will take the example of a QToolButton. If you call name(), caption(), text(), or textLabel() on a QToolButton in Qt 3, what do you expect? Just try playing around with a QToolButton in Qt Designer:
* The name property is inherited from QObject and refers to an internal object name that can be used for debugging and testing.
* The caption property is inherited from QWidget and refers to the window title, which has virtually no meaning for QToolButtons, since they usually are created with a parent.
* The text property is inherited from QButton and is normally used on the button, unless useTextLabel is true.
* The textLabel property is declared in QToolButton and is shown on the button if useTextLabel is true.
In the interest of readability, name is called objectName in Qt 4, caption has become windowTitle, and there is no longer any textLabel property distinct from text in QToolButton.
Documenting is also a good way of finding good names when you get stuck: just try to document the item (class, function, enum value, etc.) and use your first sentence as inspiration. If you cannot find a precise name, this is often a sign that the item shouldn’t exist. If everything else fails and you are convinced that the concept makes sense, invent a new name. This is, after all, how “widget”, “event”, “focus”, and “buddy” came to be.
===Naming Classes===
Identify groups of classes instead of finding the perfect name for each individual class. For example, All the Qt 4 model-aware item view classes are suffixed with View (QListView, QTableView, and QTreeView), and the corresponding item-based classes are suffixed with Widget instead (QListWidget, QTableWidget, and QTreeWidget).
===Naming Enum Types and Values===
When declaring enums, we must keep in mind that in C++ (unlike in Java or C#), the enum values are used without the type. The following example shows illustrates the dangers of giving too general names to the enum values:
In the last line, what does Insensitive mean? One guideline for naming enum types is to repeat at least one element of the enum type name in each of the enum values:
When enumerator values can be OR’d together and be used as flags, the traditional solution is to store the result of the OR in an int, which isn’t type-safe. Qt 4 offers a template class QFlags&lt;T&gt;, where T is the enum type. For convenience, Qt provides typedefs for the flag type names, so you can type Qt::Alignment instead of QFlags&lt;Qt::AlignmentFlag&gt;.
By convention, we give the enum type a singular name (since it can only hold one flag at a time) and the “flags” type a plural name. For example:
In some cases, the “flags” type has a singular name. In that case, the enum type is suffixed with Flag:
===Naming Functions and Parameters===
The number one rule of function naming is that it should be clear from the name whether the function has side-effects or not. In Qt 3, the const function QString::simplifyWhiteSpace() violated this rule, since it returned a QString instead of modifying the string on which it is called, as the name suggests. In Qt 4, the function has been renamed QString::simplified().
Parameter names are an important source of information to the programmer, even though they don’t show up in the code that uses the <span class="caps">API</span>. Since modern <span class="caps">IDE</span>s show them while the programmer is writing code, it’s worthwhile to give decent names to parameters in the header files and to use the same names in the documentation.
===Naming Boolean Getters, Setters, and Properties h3.===
Finding good names for the getter and setter of a bool property is always a special pain. Should the getter be called checked() or isChecked()? scrollBarsEnabled() or areScrollBarEnabled()?
In Qt 4, we used the following guidelines for naming the getter function:
* Adjectives are prefixed with is-. Examples:
** isChecked()
** isDown()
** isEmpty()
** isMovingEnabled()
* However, adjectives applying to a plural noun have no prefix:
** scrollBarsEnabled(), not areScrollBarsEnabled()
* Verbs have no prefix and don’t use the third person (-s):
** acceptDrops(), not acceptsDrops()
** allColumnsShowFocus()
* Nouns generally have no prefix:
** autoCompletion(), not isAutoCompletion()
** boundaryChecking()
* Sometimes, having no prefix is misleading, in which case we prefix with is-:
** isOpenGLAvailable(), not openGL()
** isDialog(), not dialog() (From a function called dialog(), we would normally expect that it returns a QDialog **.)
The name of the setter is derived from that of the getter by removing any is prefix and putting a set at the front of the name; for example, setDown() and setScrollBarsEnabled(). The name of the property is the same as the getter, but without the is prefix.
==Avoiding Common Traps==
===The Convenience Trap===
It is a common misconception that the less code you need to achieve something, the better the <span class="caps">API</span>. Keep in mind that code is written more than once but has to be understood over and over again. For example,
is much harder to read (and even to write) than
===The Boolean Parameter Trap===
Boolean parameters often lead to unreadable code. In particular, it’s almost invariably a mistake to add a bool parameter to an existing function. In Qt, the traditional example is repaint(), which takes an optional bool parameter specifying whether the background should be erased (the default) or not. This leads to code such as
which beginners might read as meaning, “Don’t repaint!”
The thinking is apparently that the bool parameter saves one function, thus helping reducing the bloat. In truth, it adds bloat; how many Qt users know by heart what each of the next three lines does?
A somewhat better <span class="caps">API</span> might have been
In Qt 4, we solved the problem by simply removing the possibility of repainting without erasing the widget. Qt 4’s native support for double buffering made this feature obsolete.
Here are a few more examples:
An obvious solution is to replace the bool parameters with enum types. This is what we’ve done in Qt 4 with case sensitivity in QString. Compare:
===The Copy Cat Trap===
==Case Studies==
===QProgressBar===
To show some of these concepts in practice, we’ll study the QProgressBar <span class="caps">API</span> of Qt 3 and compare it to the Qt 4 <span class="caps">API</span>. In Qt 3:
The <span class="caps">API</span> is quite complex and inconsistent; for example, it’s not clear from the naming that reset(), setTotalSteps(), and setProgress() are tightly related.
The key to improve the <span class="caps">API</span> is to notice that QProgressBar is similar to Qt 4’s QAbstractSpinBox class and its subclasses, QSpinBox, QSlider and QDial. The solution? Replace progress and totalSteps with minimum, maximum and value. Add a valueChanged() signal. Add a setRange() convenience function.
The next observation is that progressString, percentage and indicator really refer to one thing: the text that is shown on the progress bar. Usually the text is a percentage, but it can be set to anything using the setIndicator() function. Here’s the new <span class="caps">API</span>:
By default, the text is a percentage indicator. This can be changed by reimplementing text().
The setCenterIndicator() and setIndicatorFollowsStyle() functions in the Qt 3 <span class="caps">API</span> are two functions that influence alignment. They can advantageously be replaced by one function, setAlignment():
If the programmer doesn’t call setAlignment(), the alignment is chosen based on the style. For Motif-based styles, the text is shown centered; for other styles, it is shown on the right hand side.
Here’s the improved QProgressBar <span class="caps">API</span>:
===QAbstractPrintDialog &amp; QAbstractPageSizeDialog===
Qt 4.0 saw the apparition of two classes QAbstractPrintDialog and QAbstractPageSizeDialog that served as base classes for QPrintDialog and QPageSizeDialog. This served no purpose at all, since none of Qt’s <span class="caps">API</span>s take a QAbstractPrint- or -PageSizeDialog pointer as an argument and perform some operation on it. Using qdoc trickery, we’ve hidden them, but they’re the prototypical examples of needless abstract classes.
This is not to say ''good'' abstraction is wrong, and indeed QPrintDialog probably should have a factory or some other mechanism for changing it – as evidenced by the #ifdef <span class="caps">QTOPIA</span>_PRINTDIALOG in its declaration.
===QAbstractItemModel===
The details of the problems with model/view in Qt 4 are documented well elsewhere, but an important generalization is that “QAbstractFoo” should not just be the union of all possible subclasses you can think of at the time of writing. Such “union of all things” base classes are almost never a good solution. QAbstractItemModel commits this error – it is really just QTreeOfTablesModel, with the consequently complicated <span class="caps">API</span> that causes… and which is then ''inherited by all the nicer subclasses'',
Just adding abstraction does not make an <span class="caps">API</span> better automatically.
===QLayoutIterator &amp; QGLayoutIterator===
In Qt 3, creating a custom layout involved subclassing both QLayout and QGLayoutIterator (“G” stands for generic). A QGLayoutIterator subclass instance pointer was wrapped in a QLayoutIterator, which users could use like any other iterator class. QLayoutIterator made it possible to write code like this:
In Qt 4, we killed QGLayoutIterator classes (and their internal subclasses for box and grid layouts) and instead asked the QLayout subclasses to reimplement itemAt(), takeAt(), and count().
===QImageSink===
Qt 3 had a whole set of classes that allowed images to be incrementally read and passed to an animation – the QImageSource/Sink/QASyncIO/QASyncImageIO classes. Since all these were ever used for was animated QLabels, it was total overkill.
The lesson is not to add abstraction to aide some very vague future possibility. Keep it simple. When those future things come, it will be a lot easier to factor them into a simple system than into a complex one.
===other Qt3 vs. Qt4?===
===QWidget::setWindowModified(bool)===
===Q3Url vs. QUrl===
===Q3TextEdit vs. QTextEdit===
How all those virtual functions went a-goner…
===Qt’s Clipping Story (naming of clipping fns)===
When you set the clip rect, you actually set a region (should be setClipRegion(QRect) instead of setClipRect()).
(on the right, how it should have been…)

Revision as of 14:43, 23 February 2015