-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implementation for dns log #606
Comments
Hi @wdweng, The simplest thing you could do is convert your data to newline-delimited JSON then ingest that. That way everything should work for you out of the box without having to change code. If you do want to directly ingest DNS logs there is a way to do that (talked about in my masters thesis) but it isn't very user friendly at the moment. You will have to write a parser and a serializer for your DNS logs following a certain programming model. Additionally you will have to change some parts of the code that currently assume every record is a JSON object, in particular here at ingestion, here during serialization, and here during search (and also here during search). Note that this is purely an issue with how the code is written right now -- the archive format itself can handle cases where records are not JSON. When it comes to actually writing your parser and serializer you will first have to add a type to this enum -- this is the type that gets encoded into the Merged Parse Tree and indicates what type of structure is being represented. For writing the parser hopefully this can act as reference -- in particular note the For serialization it might be a bit more difficult to replicate what we do since the code is very optimized for serializing JSON. Here in the code the variable Going forwards this should all become much simpler, but unfortunately support for custom parsing and serialization is not very mature right now. |
thank you very much, very helpful |
Request
my graduation is about dns log compression and search
i have read your paper and found that the json version very suitable for dns log
so i want to develop one for dns log
Possible implementation
dns log is a txt file and is semi structured which has format
time--CIP--RIP--QType--QName--Resource Records very similar to json files
resource records have different length, and most values in each field are not repetitive
i want to change the code in clp-s to fit dns log input
The text was updated successfully, but these errors were encountered: