Release EDDN - Elite Dangerous Data Network. Trading tools sharing info in an unified way.

Snake Man · Jan 27, 2015

Seems bpc2eddn sending has stopped now, finally

wolverine2710 · Jan 28, 2015

Wrt poisoning the well - by accident or on purpose. EDDN is a relay and will be kept simple. No authentication etc. BUT I´m wondering. Currently FD is collecting ideas for a web-api - and poisoning the well is hopefully something they consider. Suppose FD would be using OAuth2/OpenIDx or something similar, haskeys like used in EVE. With OAuth they would be doing the authentication (hopefully separate from the store username/password). Would that be helpfull. I´m no security buff and don´t know if OAuth2 could be setup in such a way that third party tools/websites can NOT collect the username/password combinations. If safe that is something what could be considered, not only for EDDN but also by other tools. The tools would ONLY accept usernames from FD. Is that something which could work? Perhaps the security guys could say something about that.

jamesremuscat · Jan 28, 2015

I've blocked the two IPs that the BPC2EDDN spam was coming from.

If the author wants to contact me and explain what he was doing, I would appreciate it, and would gladly assist in resolving his/her problems.

Corvusnix · Jan 29, 2015

jamesremuscat said:
I've blocked the two IPs that the BPC2EDDN spam was coming from.

If the author wants to contact me and explain what he was doing, I would appreciate it, and would gladly assist in resolving his/her problems.

I'm sorry guys!
I thought it would be a good idea uploading current changes of the BPC database.
I just took the recent data updates of BPC and faked the demand value with a fixed "999999", since this is not available for the BPC.
I'm really sorry.

Andargor · Jan 29, 2015

Corvusnix said:
I'm sorry guys!
I thought it would be a good idea uploading current changes of the BPC database.
I just took the recent data updates of BPC and faked the demand value with a fixed "999999", since this is not available for the BPC.
I'm really sorry.

There he is!

Now, I don't run EDDN (James does) and in general I just meddle, so here's my humble and friendly advice

It would have been really cool to discuss your idea here, and please feel free to do so, I think your idea is really good, although there are a few issues with it, namely:

- Are Slopey and/or his users comfortable with the fact you are redistributing the data they collected? Maybe dropping them a line in Slopey's thread would be a good idea

- How are the values assigned? Dates and such. You mentioned demand being fudged since BPC doesn't fill that in. Also, dates and times are really important to a lot of apps.

- May I suggest you ask Slopey to fill in demand data and provide an option to allow the data to be redistributed to the EDDN?

Please don't let this discourage you from innovating, but a really good group of people are in this thread, and we all want EDDN to work

Corvusnix · Jan 29, 2015

Andargor said:
- Are Slopey and/or his users comfortable with the fact you are redistributing the data they collected? Maybe dropping them a line in Slopey's thread would be a good idea

Human knowledge belongs to the universe.

Andargor said:
- How are the values assigned? Dates and such. You mentioned demand being fudged since BPC doesn't fill that in. Also, dates and times are really important to a lot of apps.

Dates and times are grabbed from BPC data timestamp:
BPC data:

Code:

206411,Wuthielo Ku,          ,Tarter Dock,Medicines,Basic Medicines,395,0,465,0,,,,,29-Jan-2015 04:15,True

converted data:

Code:

{"header": {"softwareVersion": "0.1.0.0", "softwareName": "BPC2EDDN", "uploaderID": "allBPCusers"}, "$schemaRef": "[posting URL not allowed to me]", "message": {"buyPrice": 0, "timestamp": "2015-01-29T04:15:00+00:00", "stationStock": 0, "systemName": "Wuthielo Ku", "stationName": "Tarter Dock", "demand": 999999, "sellPrice": 465, "itemName": "Basic Medicines"}}

Andargor said:
- May I suggest you ask Slopey to fill in demand data and provide an option to allow the data to be redistributed to the EDDN?

My poor English skills combined with not enough time unfortunately will be a problem.

kfsone · Jan 30, 2015

Any reason there isn't a keep alive? As with EMDN, it seems people just write a "if I haven't heard from you in 30s I'll reconnect". A simple "!" sent when you haven't forwarded anything for ... 30 or 60s would probably save you traffic and load.

kfsone · Jan 30, 2015

I was writing an email explaining some Python stuff to someone and I got a bit carried away. If anyone needs a Python 3.4 implementation of a listener I've added one to the TradeDangerous repo:

https://bitbucket.org/kfsone/traded...cc4c732c5d6627c3d83c41/misc/eddn.py?at=master

example use:

Trivial example:

Code:

    import eddn
    listener = eddn.Listener()
    while True:
        batch = listener.get_batch()
        if batch:
            print("Got batch of %d" % len(batch))

More advanced example:

Code:

    import eddn

    listener = eddn.Listener(
        minBatchTime=3,         # Allow at least 3-s for a batch,
        maxBatchTime=5,         # But allow upto 5s,
        reconnectTimeout=300,   # Reconnect after 5 minutes without data,
        burstLimit=500,         # Drain upto 500 prices between polls,
    )

    def handle_listener_error(e):
        print("Listener Error:", e)

    def process_batch(batch):
        stations = set()
        items = set()
        software = set()
        for price in batch:
            stations.add(price.station)
            items.add(price.item)
            software.add(price.software + ":" + price.version)
        print("Batch: %d entries" % len(batch))
        print("Stations: %s" % (','.join(stations)))
        print("Items: %s" % (','.join(items)))

    print("Listening for 100 batches")
    while listener.stats['batches'] < 100:
        batch = listener.get_batch(onerror=handle_listener_error)
        if batch:
            process_batch(batch)
        stats, errors = listener.stats, listener.errors
        if errors or (listener.stats['batches'] % 5) == 0:
            print("Stats:")
            for stat in sorted(stats.keys()):
                print("  {:<20s} {:>10n}".format(stat, stats[stat]))
        if errors:
            print("ERRORS:")
            for error in sorted(errors.keys()):
                print("  {:<20s} {:>10n}".format(error, errors[error]))
            listener.clear_errors()

    listener.reset_counters()

If you want to display the price data in same order it shows in UI,

Code:

print(marketPrice.item, marketPrice.sell, marketPrice.buy, marketPrice.demand, marketPrice.stock)

Snake Man · Jan 30, 2015

Heh ookay, not sure if you have reversed these but sell is sell to market, buy is buy from market.

It cant be any simpler than that. I'm hoping your description was just confusing and you do it like it is in Elite.

Edit: ok I give up you edited your post while I was writing mine

kfsone · Jan 30, 2015

Snake Man said:
Heh ookay, not sure if you have reversed these but sell is sell to market, buy is buy from market.

It cant be any simpler than that. I'm hoping your description was just confusing and you do it like it is in Elite.

Edit: ok I give up you edited your post while I was writing mine

I think the code excerpt probably clarified it.

Snake Man · Jan 30, 2015

Is your eddn.py better than the client.py supplied by EDDN wiki?

maddavo · Jan 30, 2015

Hi all,

I've started a thread about price sanity checking at : https://forums.frontier.co.uk/showthread.php?t=106902

I thought the discussion was independent enough from EDDN to make a new thread, although probably most tools are using EDDN as the source of the price data that needs to be checked. Basically, I am interested to know what people think about excluding/ignoring/deleting data when it's bad, what is considered bad, different ways of calculating what's bad etc etc.

I'd appreciate if some of you clever people would contribute your thoughts and/or algorithms.

Cheers,
Maddavo

wolverine2710 · Jan 30, 2015

kfsone said:
Any reason there isn't a keep alive? As with EMDN, it seems people just write a "if I haven't heard from you in 30s I'll reconnect". A simple "!" sent when you haven't forwarded anything for ... 30 or 60s would probably save you traffic and load.

Not totally sure what you mean. Do you mean that EDDN should send a near empty message (JSON) when for say 30-60 seconds nobody has uploaded something to EDDN? A near empty message would not break the subscribers, sending a simple "!" might depending on how a listener is being written.

wolverine2710 · Jan 30, 2015

Snake Man said:
Is your eddn.py better than the client.py supplied by EDDN wiki?

I had a brief look at it. If I've interpreted it correctly (not a Python guy) Kfsone's tool does not process each EDDN message separately, it buffers them and then processes them in one go. This makes it more efficient when for example writing to an sqlite database. In an earlier post here (week or so ago I think) he described the benefits of that in detail.

Edit: Dug up his posts in this thread. See here and here.

kfsone · Jan 30, 2015

Snake Man said:
Is your eddn.py better than the client.py supplied by EDDN wiki?

The wiki "client" is a sample client (see spoiler) that demonstrates the overall workflow to get data from EDDN but then just dumps the json to the console. The "eddn.py" I wrote provides an API for doing it properly so you can write your own client without having to learn about zmq.

Wiki "client"

Code:

import zlib
import zmq.green as zmq
import simplejson
import sys


def main():
    context = zmq.Context()
    subscriber = context.socket(zmq.SUB)

    subscriber.setsockopt(zmq.SUBSCRIBE, "")
    subscriber.connect('tcp://eddn-relay.elite-markets.net:9500')

    while True:
        market_json = zlib.decompress(subscriber.recv())
        market_data = simplejson.loads(market_json)
        print market_data
        sys.stdout.flush()

if __name__ == '__main__':
    main()

My "eddn.py"

Code:

"""
Utilities for reading from the Elite Dangerous Data Network.

Example usages:

    # Simple:
    import eddn
    listener = eddn.Listener()
    while True:
        batch = listener.get_batch()
        if batch:
            print("Got batch of %d" % len(batch))

    # Advanced:
    import eddn

    listener = eddn.Listener(
        minBatchTime=3,         # Allow at least 3-s for a batch,
        maxBatchTime=5,         # But allow upto 5s,
        reconnectTimeout=300,   # Reconnect after 5 minutes without data,
        burstLimit=500,         # Drain upto 500 prices between polls,
    )

    def handle_listener_error(e):
        print("Listener Error:", e)

    def process_batch(batch):
        stations = set()
        items = set()
        software = set()
        for price in batch:
            stations.add(price.station)
            items.add(price.item)
            software.add(price.software + ":" + price.version)
        print("Batch: %d entries" % len(batch))
        print("Stations: %s" % (','.join(stations)))
        print("Items: %s" % (','.join(items)))

    print("Listening for 100 batches")
    while listener.stats['batches'] < 100:
        batch = listener.get_batch(onerror=handle_listener_error)
        if batch:
            process_batch(batch)
        stats, errors = listener.stats, listener.errors
        if errors or (listener.stats['batches'] % 5) == 0:
            print("Stats:")
            for stat in sorted(stats.keys()):
                print("  {:<20s} {:>10n}".format(stat, stats[stat]))
        if errors:
            print("ERRORS:")
            for error in sorted(errors.keys()):
                print("  {:<20s} {:>10n}".format(error, errors[error]))
            listener.clear_errors()

    listener.reset_counters()
"""

# Copyright (C) Oliver 'kfsone' Smith <oliver@kfs.org> 2015
#
# Conditional permission to copy, modify, refactor or use this
# code is granted so long as attribution to the original author
# is included.

import json
import time
import zlib
import zmq

from collections import defaultdict
from collections import namedtuple


class MarketPrice(namedtuple('MarketPrice', [
        'system',
        'station',
        'item',
        'buy',
        'sell',
        'demand',
        'stock',
        'timestamp',
        'uploader',
        'software',
        'version',
        ])):
    pass


class Listener(object):
    """
    Provides an object that will listen to the Elite Dangerous Data Network
    firehose and capture messages for later consumption.

    Rather than individual upates, prices are captured across a window of
    between minBatchTime and maxBatchTime. When a new update is received,
    Rather than returning individual messages, messages are captured across
    a window of potentially several seconds and returned to the caller in
    batches.

    Attributes:
        zmqContext          Context this object is associated with,
        minBatchTime        Allow at least this long for a batch (ms),
        maxBatchTime        Don't allow a batch to run longer than this (ms),
        reconnectTimeout    Reconnect the socket after this long with no data,
        burstLimit          Read a maximum of this many messages between
                            timer checks

        subscriber          ZMQ socket we're using
        stats               Counters of nominal events
        errors              Counters of off-nominal events
        lastRecv            time of the last receive (or 0)
    """

    uri = 'tcp://eddn-relay.elite-markets.net:9500'
    supportedSchema = 'http://schemas.elite-markets.net/eddn/commodity/1'

    def __init__(
        self,
        zmqContext=None,
        minBatchTime=5.,    # seconds
        maxBatchTime=10.,   # seconds
        reconnectTimeout=180.,  # seconds
        burstLimit=200,
    ):
        assert burstLimit > 0
        if not zmqContext:
            zmqContext = zmq.Context()
        self.zmqContext = zmqContext
        self.subscriber = None

        self.minBatchTime = minBatchTime
        self.maxBatchTime = maxBatchTime
        self.reconnectTimeout = reconnectTimeout
        self.burstLimit = burstLimit

        self.reset_counters()
        self.connect()


    def connect(self):
        """
        Start a connection
        """
        # tear up the new connection first
        if self.subscriber:
            self.subscriber.close()
            del self.subscriber
        self.subscriber = newsub = self.zmqContext.socket(zmq.SUB)
        newsub.setsockopt(zmq.SUBSCRIBE, b"")
        newsub.connect(self.uri)
        self.lastRecv = time.time()
        self.lastJsData = None


    def disconnect(self):
        del self.subscriber


    def clear_errors(self):
        self.errors = defaultdict(int)


    def reset_counters(self):
        self.clear_errors()
        self.stats = defaultdict(int)


    def wait_for_data(self, softCutoff, hardCutoff):
        """
        Waits for data until maxBatchTime ms has elapsed
        or cutoff (absolute time) has been reached.
        """

        now = time.time()

        cutoff = min(softCutoff, hardCutoff)
        if self.lastRecv < now - self.reconnectTimeout:
            if self.lastRecv:
                self.errors['reconnects'] += 1
            self.connect()
            now = time.time()

        nextCutoff = min(now + self.minBatchTime, cutoff)
        if now > nextCutoff:
            return False

        timeout = (nextCutoff - now) * 1000     # milliseconds

        # Wait for an event
        events = self.subscriber.poll(timeout=timeout)
        if events == 0:
            return False
        return True


    def get_batch(self, onerror=None):
        """
        Greedily collect deduped prices from the firehose over a
        period of between minBatchTime and maxBatchTime, with
        built-in auto-reconnection if there is nothing from the
        firehose for a period of time.

        As json data is decoded, it is stored in self.lastJsData.

        Parameters:
            onerror
                None or a function/lambda that takes an error
                string and deals with it.

        Returns:
            A list of MarketPrice entries based on the data read.
            Prices are deduped per System+Station+Item, so that
            if two entries are received for the same combination,
            only the most recent with the newest timestamp is kept.

        Errors:
            Errors are acculumated in the .errors dictionary. If you
            supply an 'onerror' function they are also passed to it.
        """
        now = time.time()
        hardCutoff = now + self.maxBatchTime
        softCutoff = now + self.minBatchTime

        # hoists
        supportedSchema = self.supportedSchema
        sub = self.subscriber
        stats, errors = self.stats, self.errors

        # Prices are stored as a dictionary of
        # (sys,stn,item) => [MarketPrice]
        # The list thing is a trick to save us having to do
        # the dictionary lookup twice.
        batch = defaultdict(list)

        while self.wait_for_data(softCutoff, hardCutoff):
            # When wait_for_data returns True, there is some data waiting,
            # possibly multiple messages. At this point we can afford to
            # suck down whatever is waiting in "nonblocking" mode until
            # we reach the burst limit or we get EAGAIN.
            bursts = 0
            for _ in range(self.burstLimit):
                self.lastJsData = None
                try:
                    zdata = sub.recv(flags=zmq.NOBLOCK, copy=False)
                    stats['recvs'] += 1
                except zmq.error.Again:
                    break

                bursts += 1

                try:
                    jsdata = zlib.decompress(zdata)
                except Exception as e:
                    errors['deflate'] += 1
                    if onerror:
                        onerror("zlib.decompress: %s: %s"%(type(e), e))
                    continue

                bdata = jsdata.decode()

                try:
                    data = json.loads(bdata)
                except ValueError as e:
                    errors['loads'] += 1
                    if onerror:
                        onerror("json.loads: %s: %s"%(type(e), e))
                    continue

                self.lastJsData = jsdata

                try:
                    schema = data["$schemaRef"]
                except KeyError:
                    errors['schemaref'] += 1
                    if onerror:
                        onerror("missing schema ref")
                    continue
                if schema != supportedSchema:
                    errors['schema'] += 1
                    if onerror:
                        onerror("unsupported schema: "+schema)
                    continue
                try:
                    header = data["header"]
                    message = data["message"]
                    system = message["systemName"].upper()
                    station = message["stationName"].upper()
                    item = message["itemName"].upper()
                    buy = int(message["buyPrice"])
                    sell = int(message["sellPrice"])
                    demand = message["demand"]
                    stock = message["stationStock"]
                    timestamp = message["timestamp"]
                    uploader = header["uploaderID"]
                    software = header["softwareName"]
                    swVersion = header["softwareVersion"]
                except (KeyError, ValueError) as e:
                    errors['json'] += 1
                    if onerror:
                        onerror("invalid json: %s: %s"%(type(e), e))
                    continue

                # We've received real data.
                stats['prices'] += 1

                # Normalize timestamps
                timestamp = timestamp.replace("T"," ").replace("+00:00","")

                # We'll get either an empty list or a list containing
                # a MarketPrice. This saves us having to do the expensive
                # index operation twice.
                oldEntryList = batch[(system, station, item)]
                if oldEntryList:
                    if oldEntryList[0].timestamp > timestamp:
                        stats['timeseq'] += 1
                        continue
                    stats['timeseq']
                else:
                    # Add a blank entry to make the list size > 0
                    oldEntryList.append(None)

                # Here we're replacing the contents of the list.
                # This simple array lookup is several hundred times less
                # expensive than looking up a potentially large dictionary
                # by STATION/SYSTEM:ITEM...
                oldEntryList[0] = MarketPrice(
                    system, station, item,
                    buy, sell,
                    demand, stock,
                    timestamp,
                    uploader, software, swVersion,
                )

            # For the edge-case where we wait 4.999 seconds and then
            # get a burst of data: stick around a little longer.
            if bursts >= self.burstLimit:
                stats['numburst'] += 1
                stats['maxburst'] = max(stats['maxburst'], bursts)
                softCutoff = min(softCutoff, time.time() + 0.5)

        # to get average batch length, divide batchlen/batches.
        # you could do the same with prices/batches except that
        stats['batches'] += 1
        if not batch:
            stats['emptybatches'] += 1
        else:
            stats['batchlen'] += len(batch)

        return [ entry[0] for entry in batch.values() ]

Askarr · Jan 31, 2015

I'd still rather pursue a schema V2 that enables per-station upload.

The buffering seems at first glance overly complex, but I can understand its need if you're writing to SQLlite on disk - it was never really designed for frequent small transactions. There are a variety of other ways to approach the problem, just for devil's advocate's sake - one route would be do the SQLlite initial loads in memory, and/or make multiple smaller DBs for time periods & merge them. As with anything in software, lots of different ways to approach the problems

Nice code - I'd recommend looking into jsonschema though; validating against the schema is more efficient and lets you make fewer hard-coded assumptions about the structure.

kfsone said:
Any reason there isn't a keep alive? As with EMDN, it seems people just write a "if I haven't heard from you in 30s I'll reconnect". A simple "!" sent when you haven't forwarded anything for ... 30 or 60s would probably save you traffic and load.

Not sure why this is necessary. I've had a single connect; while(true) { recv() } (as with the original client.py) operating for weeks. It doesn't appear to have missed any traffic. I'm no expert in 0MQ - could you explain why this shouldn't be working & a reconnect is required?

Askarr · Jan 31, 2015

Forgive the double post, but this is a rather different query. Has anyone else noticed some strange behaviour with the gateway timestamp? I'm seeing JSON of the form:

Code:

{"header": {"softwareVersion": "0.3.8", "gatewayTimestamp": "2015-01-31T03:30:53.213062", "softwareName": "EliteOCR", "uploaderID": "EO0404fa87"}, "$schemaRef": "http://schemas.elite-markets.net/eddn/commodity/1", "message": {"buyPrice": 0, "timestamp": "2015-01-31T11:29:32+00:00", "stationStock": 0, "systemName": "LHS 2441", "stationName": "Asimov Landing", "demand": 145, "sellPrice": 4548, "itemName": "Personal Weapons"}}

Code:

{'header': {'softwareVersion': 'v1.84', 'gatewayTimestamp': '2015-01-31T03:43:54.323841', 'softwareName': 'RegulatedNoise', 'uploaderID': 'e6201d08-2c23-4776-ba70-18c03b535403'}, '$schemaRef': 'http://schemas.elite-markets.net/eddn/commodity/1/test', 'message': {'buyPrice': 0, 'timestamp': '2015-01-31T12:32:00', 'stationStock': 0, 'systemName': 'Kelish', 'stationName': 'Alphonsi Orbital', 'demand': 1855, 'sellPrice': 7136, 'itemName': 'Consumer Technology'}}

It appears the gateway time and EliteOCR time hugely disagree, and I'm leaning towards EliteOCR being right as it looks like current traffic (time of writing being about 11:30 UTC). Someone messing with the gateway's system timezone? It should be in UTC, right - it certainly used to be? (that said, RegulatedNoise's timestamp looks to be an hour off)

Master-Windoo · Feb 1, 2015

Mac

Is there a version of EDDN which runs on a mac?

kfsone · Feb 1, 2015

Askarr said:
Not sure why this is necessary. I've had a single connect; while(true) { recv() } (as with the original client.py) operating for weeks. It doesn't appear to have missed any traffic. I'm no expert in 0MQ - could you explain why this shouldn't be working & a reconnect is required?

ZeroMQ only provides one delivery guarantee - that any message promoted out of it's stack will be a complete message. The Pub/Sub model and some of the language-specific APIs are especially prone to periodically losing the underlying TCP connection and there's no way to tell this as a zmq consumer. It will eventually reconnect but it's "slow".

I ran two instances of a simple C++ recv loop on two AWS instances (one in us-west-1a and one in us-east-1c) until they collected 50,000 updates each. One ran for around 3 minutes longer than the other and there are 530 updates in the us-west-1a sample that weren't in the us-east-1c sample, and 512 updates in the us-east-1c sample that weren't in the other. So each connection was missing roughly 1% of the available data. Of course, it may actually be higher because presumably both had some overlap.

This data loss is presumably silent reconnection going on at the 0MQ layer. There are probably a number of things contributing to it, including the pathological behavior of people experimenting with stuff and having zeromq connections open but not draining them, but you'd have to look at the /proc network stats on the hosting machine to see what pub-backlog is doing to its network queues.

Code:

// g++ -O3 -Wall -Werror -Wpedantic -o test.exe test.cpp 
#include <zmq.hpp>
#include <cstdio>
#include <unistd.h>
#include <fcntl.h>

int main()
{
    zmq::context_t context(1);
    zmq::socket_t socket(context, ZMQ_SUB);

    socket.setsockopt(ZMQ_SUBSCRIBE, "", 0);
    socket.connect("tcp://eddn-relay.elite-markets.net:9500");

    int outf = open("/tmp/zmq.data", O_TRUNC|O_CREAT|O_WRONLY);
    if (outf < 0) {
        printf("Unable to open output file\n");
        return -1;
    }

    size_t dataSize;
    zmq::message_t data;
    for ( size_t packet = 0; packet < 50000; ++packet ) {
        socket.recv(&data);
        dataSize = data.size();
        // Write the length of the data
        write(outf, &dataSize, sizeof(dataSize));
        // Write the actual data
        write(outf, data.data(), dataSize);
    }

    close(outf);
}

Askarr · Feb 1, 2015

kfsone said:
ZeroMQ only provides one delivery guarantee - that any message promoted out of it's stack will be a complete message. The Pub/Sub model and some of the language-specific APIs are especially prone to periodically losing the underlying TCP connection and there's no way to tell this as a zmq consumer. It will eventually reconnect but it's "slow".

Superb explanation. Thank you. I did some digging and came across this post from one of the contributors to libzmq:

There are few solutions to the problem:

1. As mentioned above you can enable the tcp keep alive
2. Make the publisher connect to the subscriber and make the subscriber
bind, if the connection is dead the publisher will recognize that and
reconnect.
3. Use xsub and xpub and sent keep keep alive messages from the subscriber
to the publisher, on the publisher just ignore the messages. If the
connection is dead the subscriber will recognize that when trying to send
the message and the subscriber will reconnect.
4. From the publisher send keep alive messages every X seconds, on the
subscriber if you don't get a message after Y (usually 2*X) close the
existing socket and reconnect.

Out of the 4 I prefer the second if it possible(the amount of subscriber is
fixed or small) and the forth[sic] if not.

I would conclude that the publisher being run for EDDN should indeed send keep-alive messages, and it's our job as subscribers to reconnect some time after that keep-alive period has elapsed.

From that message thread, it seems that there's really two problems - if the publisher is rebooted (or equivalent loss of connection), the subscriber has no direct way to know that it has to re-establish the connection - if the subscriber is rebooted, the publisher has no idea that the connection it's publishing to is dead. The latter means the publisher is sending traffic to a dead connection & is less of a big deal beyond network traffic. The former means subscribers lose data - I'd say that's quite a problem.

While we're on the subject, the other way a subscriber can lose data is if it fails to receive data quickly enough. If the High Water Mark of the queue for the publisher for that subscriber is reached, it will discard messages. Therefore the other thing a subscriber must do is process incoming messages as fast as possible. Do no long-blocking work at all that prevents you from calling recv() again as soon as possible (or equivalent Tornado ioloop callback).

Release EDDN - Elite Dangerous Data Network. Trading tools sharing info in an unified way.

Snake Man

wolverine2710

Tutorial & Guide Writer

jamesremuscat

Corvusnix

Andargor

Corvusnix

kfsone

kfsone

Snake Man

kfsone

Snake Man

maddavo

wolverine2710

Tutorial & Guide Writer

wolverine2710

Tutorial & Guide Writer

kfsone

Askarr

Askarr

Master-Windoo

kfsone

Askarr