Some data: How far to go to find a new system?

"How far do I have to go to find a new system?" This question comes up from time to time, and answer tend to vary a lot. Specific numbers seem to be mostly anecdotes, so I was curious if we could glean some insight into this from the EDSM data. Moreover, to see how this has changed over time, to see the rate at which systems might be "running out".
The biggest weakness there, of course, is that of course not all systems are uploaded on EDSM, so any numbers have to be experimentally verified. There's another important thing to consider: in the EDSM data dumps we do have, the system's timestamp is when it was last updated (with more precise coordinates), not when it was first added. Unfortunately, we don't have the latter in them. However, we do have the EDSM ID number, which is incremental, so at least we can guess fairly well if a system was added recently, or added way back and updates recently. Thanks to Orvidius for this idea, originally I ignored and stripped out the IDs.
Plotting the IDs over time shows rather large gaps, with actually new systems being the closest only once a week or so. Still, over the grand total, that's good enough.

So, with that in mind, the results were surprising, and enough to make me want to go back to the bubble and see for myself. The data is here: https://docs.google.com/spreadsheets/d/1-9mcezzhktEkcBfkS1LdsxPDzFojF7vy43fGLv5zWXs], and I'll link shots of the charts here later.
The point is: these days, on EDSM data, you have to go 250-270 ly to find an unreported system. This was slowly but steadily increasing from 150 ly in 2017. April. (There are some downward spikes that are far too close: those outliers tend to be some populated systems in the bubble getting updated.)

Being highly sceptical that you could actually find a new, entirely untagged system so close, I set out to see how close I could actually get. I picked one of the less popular directions, into the brown dwarf layer. The results: new untagged bodies started appearing around 300 ly out, but my first entirely untagged systems were some 550 ly away.
So, with that, I added a 2.5 multiplier to the data as well, just to note this. I was more curious about the rates of increase, to see how quickly the distance increases. Well, as it turns out, very slowly.

Now, bear in mind that these are strictly new systems, and not cherry-picking: most of them were mass code A, only a couple of mass code B, M dwarfs. The rest were L, T, Y. No non-dwarf stars, no higher mass codes. But the question was how far to go to find a new system, and not how far to a new mass code D system and so on.
If you were to look for ELWs, you'll likely have to go somewhere between 600-800 ly to find a new one.

To put things better into perspective, we should also look at how many jumps these are, and how that changed over time. Going by my thread on how they changed over time, viewed on the case of an Asp Explorer, it's around nine jumps (for the multiplied distances, rounded up). Over time, this changed even less, although the biggest range increase with the introduction of Engineers happened before the data range we have here.


So, there we have it. The answer to how far you have to go isn't in thousands of lightyears, but hundreds instead. Or in terms of time, roughly ten minutes of gameplay. (Unless you stop to scan already-discovered systems.)

Of course, if you start cherry-picking things and filter out stars, then you'd likely end up with results farther than these - much farther for a fresh mass code D system, for example. Those might be worth looking into too, but I'll leave that up to others, as I'm looking into what I find more interesting stuff these days.
Plus this is all very much directional: if you zoom in on heatmaps (especially EDAstro's Alt. Heatmap), it's plain to see. The usual advices of "don't pick a popular direction, fly off the plane" still apply, naturally.
 
Last edited:
Thanks Marx - again, an interesting answer to a simple question. Those single undiscovered systems are closer than expected!

On the other hand, if ppl are not looking for that single undiscovered system, but rather from where the majority of systems becomes undiscovered, we're probably still around the 1kly (off the beaten tracks of course).
 
While I'm one of those who stops and scans already discovered systems - just to see what's there, POIs to visit, etc - I find it very easy to run across undiscovered systems provided you're not being picky about discovering something/anything. I jump out of the bubble, big jumps ('Fastest Routes' setting), to an arbitrary location and then set a new arbitrary goal and merely change the setting for my jumps routing to be short jumps ('Economical Routes' setting) and let it plot... also removing all star type filtering, allowing everything. I'm finding there is plenty of stuff to be discovered out there!
 
There's probably still a few in the bubble. I get undiscovered planets in the bubble still semi-regularly. I've found a couple between the bubble and Betelgeuse and Rigel. Towards the bubble nebula I found a good few. Also out near ngc7288 even though it seems alot of people have been out there.
 
When I left the bubble again a few weeks ago, I tried using the Visited Stars Cache to plot my way out using unvisited systems (from EDSM's perspective). From Shinrarta Dezhra, I only needed to go about five jumps before hitting systems that weren't in EDSM yet, in the direction I chose to travel. At this distance they were mostly tagged in game already, but perhaps not 100%. However, if it's tagged but I'm the submitter for EDSM, I still consider that a half-win. ;)
 
That is a great analysis. I especially like all the reasoning around the fact that certain things can't be seen in the data easily and how to work around it :) .
But why is it going linear? Shouldn't it be more like the third cube over time? Unless the number of players increases heavily. At least the ones reporting to EDSM.

Also, the factor of 2.5 … øhm … in general I wouldn't agree to that after just one "experiment". But it fits rather well with other numbers "out there" and my stomach feeling ;)
 
But why is it going linear? Shouldn't it be more like the third cube over time? Unless the number of players increases heavily. At least the ones reporting to EDSM.
Yeah, the linear regression was just to make it simple, but now that I look at it again, third degree polynomial does work better. Thanks for reminding me of it!
Oh, and the number of players (exploring and reporting to EDSM) increased heavily during DW2, but that fell off after it ended. (More on this at a later time.) However, it doesn't really matter how many uploaders are out there exploring in total: only how many uploaders are close to the bubble does.

Also, the factor of 2.5 … øhm … in general I wouldn't agree to that after just one "experiment". But it fits rather well with other numbers "out there" and my stomach feeling ;)
Yes, it's a quick-and-dirty hack, really. It should be determined better for a serious investigation, but I didn't find this whole matter that important to spend much more time looking into it. (Like with breaking down distances per mass code, star type, and so on.)
 
Thanks, as always, for the data @marx

For the record I tagged a catalog system (HR 5619 - A class) for EDSM at under 400 LY from Sol (on Friday), so there are still some interesting systems that haven't been uploaded yet.

Personally I'm past caring about First Discovered tags, but I'm always pleased to add a new system to EDSM.
 
I remember I found my first when I was still piloting a Hauler. Just a few jumps directly down from the edge of the bubble.
 
There are a surprising number of named HD systems near to bubble which are not explored (as of EDSM)....

I've been thinking of creating a spreadsheet for all of the stars in the HD and HIP catalogs to make sure they're all accounted for, but I have no idea how I'd go about doing that.
 
I've been thinking of creating a spreadsheet for all of the stars in the HD and HIP catalogs to make sure they're all accounted for, but I have no idea how I'd go about doing that.
The HD and HIP catalogs together contain around a half million entries, corresponding to (considering overlap) around 300,000 distinct stars. We don't know the criteria FDev used to pick which of those stars made it into the Elite galaxy. Especially in the Bubble, many of those stars are present but under other names. So the fact that a star exists in one of those catalogs but not in EDSM, does not by itself tell you very much about whether that star is actually in the game. You would need to essentially reproduce FDev's reasoning when they were building the galaxy, to predict which of those stars would be likely to have made it in. That would be an interesting project for someone, but definitely not easy, and making a list of HIP and HD entries would be only the first small step.
 
There are a surprising number of named HD systems near to bubble which are not explored (as of EDSM)....
I wouldn't be surprised at all if they were all explored, even by people you see on EDSM: in their case, they likely flew to them well before journal files were added to the game. Besides, the closer you are to the beaten paths (and nothing is more beaten than the bubble), the more likely that an already visited system is not yet on EDSM.

Only one way to find out though.


As for what Frontier didn't import: they had to convert the catalogue data to whatever format(s) Elite uses, and because of that, there certainly would be errors. Given the number of systems (plus the schedule), it could easily be that whenever the process failed, they would just discard the system, and that's why it's missing. Still plenty of stars to visit, after all.
 
The key criterion for inclusion was to have a distance measurement, which is why the star catalogues we see in game tend to be of the brighter stars or unusual stars at long distance (e.g. pulsars but also some known from gravitational microlensing). There is a fiction diary from the early development days with Michael Brookes where he says that they used the Hipparcos, Gliese and a couple of other star catalogues. Hipparcos was a satellite that measured distances via parallax, so everything in its catalogue has a distance measurement. The Tycho catalogue also derives from Hipparcos data. Star clusters with known distances can include objects from other catalogues that don't always have a distance, or which were small catalogues just of the stars in the cluster.

I have had a little success in the last couple of months looking for untagged bodies within ~400LY of Sol. T Tauri stars are much more likely to have untagged bodies but completely untagged systems are very rare. I've been looking for The Dark Wheel's hidden base, so unmapped 8th moons are also of interest and there are plenty of unmapped bodies in systems with unscoopable or M type primary stars. Untagged bodies tend to be distant and were probably left alone by someone passing through with an old Advanced Discovery Scanner.
 
I found an untagged system (binary M-class, no planets) 515 Ly from Shinrarta several weeks ago.
Sounds about right. The distance from Sol depends on which direction from Shinrarta Dezhra you found it though, but it would be around the estimated 550 ly in the first post. That was back in September, but a cursory look at recent data seems to suggest it's mostly the same these days.
A thought just crossed my mind: it's one fleet carrier jump. (Assuming numbers don't get revised, of course.)
 
Last edited:
I found an untagged system (binary M-class, no planets) 515 Ly from Shinrarta several weeks ago.
Not bad - I think my closest untagged system is the one 620 LY from Sol. I've also found a mostly-untagged system 570 LY out. Both well below the plane.

It might be interesting to separately plot closest discoveries by broad direction - above/below plane, in the plane by quadrant, that sort of thing. Seem like a safe bet that the closest undiscovered system south of the Bubble (i.e. towards Orion and other interesting local destinations) is going to be a good deal farther out than in other directions. For instance, when I first loaded up the big visited stars cache file, it was striking to see that the space between Col 70 and the Pleiades/California corridor is almost totally filled in out to 2000 LY or so. 2000 LY north of the bubble there's mostly undiscovered (to EDSM) systems outside of the narrow lanes leading to Sag A* and Colonia.
 
Back
Top Bottom