项目作者: akkadotnet

项目描述 :
Polymorphic serialization for .NET
高级语言: C#
项目地址: git://github.com/akkadotnet/Hyperion.git
创建时间: 2016-12-12T17:43:18Z
项目社区:https://github.com/akkadotnet/Hyperion

开源协议:Apache License 2.0

下载


Hyperion

Join the chat at https://gitter.im/akkadotnet/Hyperion

A high performance polymorphic serializer for the .NET framework.

Current status: BETA (v0.9.14).

License

Licensed under Apache 2.0, see LICENSE for the full text.

Polymorphic serializations

Hyperion was designed to safely transfer messages in distributed systems, for example service bus or actor model based systems.
In message based systems, it is common to receive different types of messages and apply pattern matching over those messages.
If the messages does not carry over all the relevant type information to the receiveing side, the message might no longer match exactly what your system expect.

Consider the following case:

  1. public class Envelope
  2. {
  3. //untyped property
  4. public object Payload {get;set;}
  5. //other properties..
  6. ...
  7. }
  8. ...
  9. var envelope = new Envelope { Payload = (float)1.2 };

If you for example are using a Json based serializer, it is very likely that the value 1.2 will be deserialized as a double as Json has no way to describe the type of the decimal value.
While if you use some sort of binary serializer like Google Protobuf, all messages needs to be designed with a strict contract up front.
Hyperion solves this by encoding a manifest for each value - a single byte prefix for primitive values, and fully qualified assembly names for complex types.

Surrogates

Sometimes, you might have objects that simply can’t be serialized in a safe way, the object might be contextual in some way.
Hyperion can solve those problems using “Surrogates”, surrogates are a way to translate an object from and back to the context bound representation.

  1. var surrogate = Surrogate.Create<IMyContextualInterface,IMySurrogate>(original => original.ToSurrogate(), surrogate => surrogate.Restore(someContext));
  2. var options = new SerializerOptions(surrogates: new [] { surrogate });
  3. var serializer = new Serializer(options);

This is essential for frameworks like Akka.NET where we need to be able to resolve live Actor References in the deserializing system.

Whitelisting Types On Deserialization

Sometimes we need to limit the types that are allowed to be deserialized for security reasons. For this reason, you can either pass a class instance that implements the ITypeFilter interface into the SerializerOptions or use the TypeFilterBuilder class to build a TypeFilter that Hyperion can use to filter out any possibly harmful injection attack during deserialization.

using the ITypeFilter interface:

  1. public sealed class TypeFilter : ITypeFilter
  2. {
  3. public ImmutableHashSet<string> FilteredTypes { get; }
  4. internal TypeFilter(IEnumerable<Type> types)
  5. {
  6. FilteredTypes = types.Select(t => t.GetShortAssemblyQualifiedName()).ToImmutableHashSet();
  7. }
  8. public bool IsAllowed(string typeName)
  9. => FilteredTypes.Any(t => t == typeName);
  10. }

using the TypeFilterBuilder convenience builder:

  1. var typeFilter = TypeFilterBuilder.Create()
  2. .Include<AllowedClassA>()
  3. .Include<AllowedClassB>()
  4. .Build();
  5. var options = SerializerOptions.Default
  6. .WithTypeFilter(typeFilter);
  7. var serializer = new Serializer(options);

Convert Whitelist To Blacklist

To do blacklisting instead of whitelisting a list of types, you will need to do a slight modification to the TypeFilter class.

  1. public sealed class TypeFilter : ITypeFilter
  2. {
  3. public ImmutableHashSet<string> FilteredTypes { get; }
  4. internal TypeFilter(IEnumerable<Type> types)
  5. {
  6. FilteredTypes = types.Select(t => t.GetShortAssemblyQualifiedName()).ToImmutableHashSet();
  7. }
  8. public bool IsAllowed(string typeName)
  9. => FilteredTypes.All(t => t != typeName);
  10. }

Version Tolerance

Hyperion has been designed to work in multiple modes in terms of version tolerance vs. performance.

  1. Pre Register Types, when using “Pre registered types”, Hyperion will only emit a type ID in the output stream.
    This results in the best performance, but is also fragile if different clients have different versions of the contract types.
  2. Non Versioned, this is largely the same as the above, but the serializer does not need to know about your types up front. it will embed the fully qualified typename
    in the output stream. this results in a larger payload and some performance overhead.
  3. Versioned, in this mode, Hyperion will emit both type names and field information in the output stream.
    This allows systems to have slightly different versions of the contract types where some fields may have been added or removed.

Hyperion has been designed as a wire format, point to point for soft realtime scenarios.
If you need a format that is durable for persistence over time.
e.g. EventSourcing or for message queues, then Protobuf or MS Bond is probably a better choice as those formats have been designed for true version tolerance.

Performance

Hyperion has been designed with a performance first mindset.
It is not the most important aspect of Hyperion, Surrogates and polymorphism is more critical for what we want to solve.
But even with its rich featureset, Hyperion performs extremely well.

  1. Hyperion - preregister types
  2. Serialize 312 ms
  3. Deserialize 261 ms
  4. Size 38 bytes
  5. Total 573 ms
  6. Hyperion - no version data
  7. Serialize 327 ms
  8. Deserialize 354 ms
  9. Size 73 bytes
  10. Total 681 ms
  11. Hyperion - preserve object refs
  12. Serialize 400 ms
  13. Deserialize 369 ms
  14. Size 73 bytes
  15. Total 769 ms
  16. MS Bond
  17. Serialize 429 ms
  18. Deserialize 404 ms
  19. Size 50 bytes
  20. Total 833 ms
  21. Hyperion - version tolerant
  22. Serialize 423 ms
  23. Deserialize 674 ms
  24. Size 195 bytes
  25. Total 1097 ms
  26. Protobuf.NET
  27. Serialize 638 ms
  28. Deserialize 721 ms
  29. Size 42 bytes
  30. Total 1359 ms
  31. Jil
  32. Serialize 1448 ms
  33. Deserialize 714 ms
  34. Size 123 bytes
  35. Total 2162 ms
  36. Net Serializer
  37. Serialize 1289 ms
  38. Deserialize 1113 ms
  39. Size 39 bytes
  40. Total 2402 ms
  41. Json.NET
  42. Serialize 3767 ms
  43. Deserialize 5936 ms
  44. Size 187 bytes
  45. Total 9703 ms
  46. Binary formatter
  47. Serialize 10784 ms
  48. Deserialize 11374 ms
  49. Size 362 bytes
  50. Total 22158 ms

This test was run using the following object definition:

  1. public class Poco
  2. {
  3. public string StringProp { get; set; } //using the text "hello"
  4. public int IntProp { get; set; } //123
  5. public Guid GuidProp { get; set; } //Guid.NewGuid()
  6. public DateTime DateProp { get; set; } //DateTime.Now
  7. }

Big disclaimer: The above results change drastically depending on your contracts, e.g. using smaller messages favor both NetSerializer and Jil.
There is no “best” or “fastest” serializer, it all depends on context and requirements.