But that's what I'm asking, what are the benefits?
First of all, let's make clear one of the concepts of Star Citizen is player interaction. You know, CR wants us to interact with all this stuff in 1st person. Interact with objects, interact with ships, interact with other players... No loading screens, all seamless, all in 1st person, etc... So for this game, much stuff has to be made in terms of player animations, animations of players interacting with the ship controls, the player getting into and out of vehicles, grabbing and interacting with stuff like boxes or other objects like a bottle of water, etc...
If you have a faked 1st person view, that means you have to do everything twice, and not only that, you must do all player interactions twice, and make it so it all looks good from both views, even if the hands may be in different places in 1st and 3rd person views, so if you grab a box in 1st person, you must make that box fit inside your hands in 1st person, and after that you must do the same in 3rd person, and try to make it so it looks ok even when you know that it won't be exactly in the same place in 1st person and in 3rd person. If you get a weapon you must do 2 weapon models, one for 3rd person, and another for 1st person which must be distorted for it to look good on your 1st person field of view. You must do 2 animations, one for each point of view. And if you want to make it so the player can grab the weapon and equip it seamlessly, you must do some kind of animation in which you hide the transition of that weapon between the 3rd person model and the 1st person one.
Ok this looks like a lot of work, right?
If you have unified 1st/3rd person views, like SC does, you just have to make one animation, you can have just one model, no need to make tricks to fake transitions between models or to make 3rd person view objects to fit into 1st person views. So this means less work needed in the long term when creating content/animations, and less possible glitches, which translate in less work in the long term. And what you see is what others see, it wont happen that you think you're completely covered behind an object, but you get shot because your 3rd person model is exposed to enemy fire because it's different than the 1st person one.
Oh, and less megabytes to download. And the wow factor, too.