Template Metaprogramming

技术2022-05-11 93

Template Metaprogramming 1. 何谓 Metaprogramming? Metaprogram: program that manipulates another program. Metaprogramming is not a new concept: Compiler is a metaprogram: manipulates your code and produces code in a lower level code Preprocessor YACC 这个容易理解，就像 Metadata 是“关于数据的数据”一样，Meta 这个前缀本身是很能说明问题的，如果把编译器也看成是一个 Metaprogram ，那么这个概念的确是没有什么神秘的，计算机里面的绝大多数程序，干的事情都不外乎把这种数据变成另外一种数据。 2. Metaprogramming 的几种途径 One approach: external to the language that is being manipulated Another approach: Domain language and host language are the same 第一种就是说的 YACC之类的了，当然，命令解释器也当仁不让的可以算是。第二种么，Preprocessor 当然算是了，template 则是重头戏了。 3. 举几个简单的例子先？ CppTM 领域最经典最简单的例子莫过于计算阶乘了，它简单而有用，同时体现了 CppTM 的递归本质。我想绝大多数人当年学习递归的时候也是从这个例子开始的： #include <iostream> using namespace std; template <int N> struct factorial { static const int value = N * factorial<N - 1>::value; }; template<> struct factorial<0> { static const int value = 1; }; int main() { cout << "factorial<10>: " << factorial<10>::value << endl; cout << "sizeof char[factorial<4>::value]: " << sizeof(char[factorial<4>::value]) / sizeof(char) << endl; } 输出： factorial<10>: 3628800 sizeof char[factorial<4>::value]: 24 这只是一个回顾，上面的简单程序就不用我解释了吧？下面的这个 remove_cv 算法会去掉参数类型的 const 和 volatile 修饰符（如果有的话），Boost type_traits 就是这么干的。 #include <iostream> using namespace std; template <class T> struct remove_cv { typedef T type; }; template <class T> struct remove_cv<const volatile T> { typedef T type; }; template <class T> struct remove_cv<const T> { typedef T type; }; template <class T> struct remove_cv<volatile T> { typedef T type; }; int main() { cout << "remove_cv<const int>: " << typeid(remove_cv<const int>::type).name() << endl; cout << "remove_cv<volatile int>: " << typeid(remove_cv<volatile int>::type).name() << endl; cout << "remove_cv<const volatile int>: " << typeid(remove_cv<const volatile int>::type).name() << endl; } 输出： remove_cv<const int>: int remove_cv<volatile int>: int remove_cv<const volatile int>: int 这个也很简单，但是非常有用。最后再来一个，相当有用的，它让我们可以在编译期间把一个数字作为二进制数来解释： #include <iostream> using namespace std; template <unsigned long N> struct binary { static unsigned const value = binary<N/10>::value << 1 | N; }; template <> struct binary<0> { static unsigned const value = 0; }; int main() { cout << "binary<1>: " << binary<1>::value << endl; cout << "binary<11>: " << binary<11>::value << endl; cout << "binary<101>: " << binary<101>::value << endl; cout << "binary<111>: " << binary<111>::value << endl; cout << "binary<1011101>: " << binary<1011101>::value << endl; } 输出： binary<1>: 1 binary<11>: 3 binary<101>: 5 binary<111>: 7 binary<1011101>: 93 不过上面这个程序不容错，换句话说，如果你写 binary<123>::value ，编译器不会阻止你，还会给出一个愚蠢的答案。如果要做一个容错的解释器，只需要玩一个小小的把戏（本人原创）： #include <iostream> using namespace std; namespace aux{ // 对于 0 和 1 以外的数，都不定义 value ，这样在出现 0 1 之外的数 // 的时候，编译器会抱怨找不到 value template <unsigned long N> struct binary {}; template <> struct binary<1> { static unsigned const value = 1; }; template <> struct binary<0> { static unsigned const value = 0; }; } template <unsigned long N> struct binary { static unsigned const value = binary<N/10>::value << 1 | aux::binary<N>::value; }; template <> struct binary<0> { static unsigned const value = 0; }; int main() { cout << "binary<1>: " << binary<1>::value << endl; cout << "binary<11>: " << binary<11>::value << endl; cout << "binary<101>: " << binary<101>::value << endl; cout << "binary<111>: " << binary<111>::value << endl; cout << "binary<1011101>: " << binary<1011101>::value << endl; // 你可以 uncomment 下面这一行看看会发生什么 //cout << "binary<123>: " << binary<123>::value << endl; } 输出还是一样，但是如果你写了 binary<123>::value 这样的东西，编译器就会抱怨了： error C2039: 'value' : is not a member of 'aux::binary<N>' with [ N=2 ] // bla bla bla 好了，例子够多了，我们可以稍微总结一下。CppTM 的好处在于：把很多计算放到编译期间完成，使得运行效率大为提高由于计算在编译期间完成，很多错误也可以在编译期间发现，程序员不用到了程序开始跑了才进入痛苦的调试有一些事情，比如 remove_cv ，在运行期间还的确不那么好做 4. 我怎么开始 Metaprogamming 呢？在我们开始学习编程的时候，首先学到的是赋值、条件判断、循环等等。在 CppTM 中，这些有了变化，如下：循环 --> 模板递归条件判断 --> 模板偏特化赋值 --> 没有，变量的值一旦确定就不会变化（这对于 functional programming 是常事）函数输入输出 --> 类型和常量其实这些特征并不是什么旁门左道，正好相反，它具有 functional programming 的特征，符合图灵机模型（感兴趣的话可以看这篇 paper: C++ Templates are Turing Complete）。 5. 模板偏特化是个好东西，但是我每次都要把 if...then...else 映射成它，岂不是要累死？况且这编码量也太大... 这是个好问题，好在在计算机科学里面有一句箴言：You can solve everything by adding an extra layer of abstraction. 使用模板偏特化进行条件判断如此常用，我们完全应该把它抽象出来以备重用： template <bool Cond, class Then, class Else> struct if_ { typedef Then type; }; template <class Then, class Else> struct if_ <false, Then, Else> { typedef Else type; }; 简单吧？虽然简单，但是我们从此却可以在更高的抽象层面上看问题，我们摆脱了用模板偏特化思考，现在可以直接用 if...then...else 来思考了。那么，上面解释二进制数的程序就变成了下面这样： #include <iostream> using namespace std; template <bool Cond, class Then, class Else> struct if_ { typedef Then type; }; template <class Then, class Else> struct if_ <false, Then, Else> { typedef Else type; }; namespace aux{ struct one { static unsigned const value = 1; }; struct zero { static unsigned const value = 0; }; struct other {}; } template <unsigned long N> struct binary { static unsigned const value = if_ <N/10 == 0, aux::zero, binary<N/10> >::type::value << 1 | if_ <N == 0, aux::zero, if_ <N == 1, aux::one, aux::other>::type >::type::value; }; int main() { cout << "binary<1>: " << binary<1>::value << endl; cout << "binary<11>: " << binary<11>::value << endl; cout << "binary<101>: " << binary<101>::value << endl; cout << "binary<111>: " << binary<111>::value << endl; cout << "binary<1011101>: " << binary<1011101>::value << endl; // 你可以 uncomment 下面这一行看看会发生什么 //cout << "binary<123>: " << binary<123>::value << endl; } 输出还是一样，但是现在不仅程序长度减少，而且相关的逻辑也用我们熟悉的 if...then...else 的方式来表达。多一层抽象果然威力强大！当然，循环还是要用模板递归的。 6. 流程控制的问题解决了，下面呢？我们是不是需要一些容器，像 STL 那样？如果要表达一个装“类型”的容器，你会怎么做？象下面这样么？ struct types { typedef int t1; typdef long t2; typedef std::vector<double> t3; } 大概只要稍微明智一点，你就会马上放弃这个想法，它太没有通用性了。真正的启示来自于 Lisp ，在 Lisp 中，表是最重要的数据结构，几乎是“万物皆表”。一个表由一个头和一个尾组成，可以嵌套，空表用 nil 表示。这种简单的概念却有着不可思议的表达能力。如果我们用 C++ 来模拟，就是这样： template <class First, class Rest> struct cons { typedef First first; typedef Rest rest; }; struct nil {}; 现在我们需要表示一个类型列表就有章可循了： typedef cons<int, cons<long, cons<std::vector<double>, nil> > > a_type_list; 它是递归的，从而我们可以很容易的用递归的方式来对它们作协操作，用来操作它们的，就是 Metafunction，前面的 if_ 就是一个 Metafunction。例如我们想选择两个类型中比较大的一个，可以写一个 choose_larger Metafunction： template <typename T1, typename T2> struct choose_larger { typedef typename if_ <(sizeof(T1) > sizeof(T2)), T1, T2>::type type; }; 我们再次看到，由于有了 if_ ，我们的生活变得轻松多了。让我们继续向前发展，我们不想仅仅停留在两个类型比大小上，我们希望选择一个 type list 里面最大的那一个： template <typename T> struct largest; template <typename First, typename Rest> struct largest<cons<First, Rest> > : choose_larger<First, typename largest<Rest>::type> {}; template< typename First > struct largest<cons<First,nil> > { typedef First type; }; 其实上面的也可以用上 if_ ，只不过我们现在还没有写出判断一个类型是否为 nil 的 Metafunction ，这是件很简单的事情，大家可以自己去写写看。有了它们，得到一个 type list 里面最大的元素就轻而易举了： #include <vector> #include <iostream> using namespace std; //... 上面的那些 Metafunction int main() { typedef cons<int, cons<long, cons<std::vector<double>, nil> > > type_list; cout << typeid(largest<type_list>::type).name() << endl; } 输出： class std::vector<double,class std::allocator<double> > 7. 的确很棒，但是这跟我们当年学数据结构以后，没事就写个链表玩玩差不多，在生产环境中可不能这样，用什么通用的方法么？终于到了这一步，在“重复发明轮子”足够多次以后，终于就有人会出来发明通用轮子的，MPL 就是一个很好的尝试，当然 Loki 也算是。还记得 if_ 让我们尝到的甜头么？聪明人在看到了这些甜头以后，是决不会止步不前的，Dave Abrahams 和 Aleskey Gurtovoy 就是这样的聪明人，他们发明了 MPL 。留到下一篇好了。

专利

最新回复(0)