The Story of Project Erebus

Introduction:

Project Erebus has been in a constant state of change over the past several months. What initially started off as a system with 6 water cooled Nvidia GTX 480′s in a over the top gaming motherboard has morphed into a system with 8 ATI 6970 cards housed in a rack mounted server case. While I don’t think it’s safe to say the transformation is complete it has certainly reached a precipitous. In the ever changing world of computing the bar of performance, and speed are in a constant state of escalation. What one days is top of the line, the next is just old news. At least for a short while I have the honor of setting the bar in some respects, even if it is a losing battle.

The build, V 1.0

Erebus version 1, was composed of a EVGA SR-2 Classified motherboard and 6 water cooled GTX 480′s running at 1504Mhz. Water cooling was done so that all 6 cards could fit into the 7 available PCI express slots. The intention was to leave the center slot open for the Magma expansion bay.  The Magma PCI-E expansion chassis provided room for several more cards, however a limitation of the SR-2′s 32bit BIOS prevented more than 8 total cards. The chassis was fitted with two Galaxy O/C cards running at 1520Mhz, slightly faster then the EVGA cards, but not requiring water cooling. With regards to a full Nvidia system this was one of the fastest personally owned single systems at the time. Any application capable of accessing multiple cards was able to utilize all 3,840 stream processors available in all 8 cards.

Version 1.5

In an effort to increase the available bus speed and the hopes that all 10 slots of the TYAN FT72-B7015 motherboard could be used an order was put in with RenderStream a company based out of Austin TX. The board it’s self is much larger than even the already large SR-2. However the bare bones system consist of an entire chassis and power supply. Simply moving the memory, processors and hard drives over to the Tyan board I was up and running in Ubuntu in no time. At nearly the same time Nvida had released the GTX 580′s increasing clocks speeds and unlocking all 512 stream processors from the GF110 core. Immediately 4 EVGA GTX 580 hydro-coppers were ordered with the plans to order 4 more once they were in stock. It was at that time I discovered that in fact not all 10 slots could be used on the Tyan board, even using the Magma expansion bay.  

There was some issues initially with getting all 8 cards to work, but once the BIOS was configured correctly both Windows 7 and Ubuntu 64 were able to detect all 8 cards. The system consisted of 4 GTX 480′s and 4 GTX 580′s any attempt to add another card regardless of the type would cause the system not to post, it had seemed as if the BIOS was unable to address all cards. Once again I was limited to 8 cards, it was a rather big disappointment because I hoped to be able to use the expansion and have a system with potentially 10 or 11 cards. Then there were rumors that Nvidia was to release the GTX 590′s essentially 2 under clocked 580 cores housed on one double slot card. This would give me the potential to have 16 total cards. However shortly before the release of the GTX 590′s I had decided to make a some what dramatic decision.

Version 2.0

After a short discussion in my frequented IRC channel I immediately searched and found a good price on 8 ATI 6970′s. There was some initial concerns if the Tyan motherboard would support ATI cards at all. I only knew of a small handful of people that had ordered the RenderStream system and no one that I was aware of had even tried ATI cards in the system. All the Nvidia cards were removed and the 6970′s were installed. At first I thought that Ubuntu could only detect 6 cards, but as it turns out there was some issues with aticonfig correctly detecting everything.  Manually setting the necessary configs got everything working and the results have been nothing but positive.

Results:

My first test consisted of running a new oclHashcat Beta application called oclHashcat-lite. Oclhc-lite is targeted at new users of the oclhc series of tools. Without the need to learn all the details of regular oclHashcat. In addition oclhc-lite was optimized for single hash performance utilizing the benefits of round reversal and various other hashing algorithm short cuts and GPU techniques to achieve the highest overall possible speeds. First up was the MD5 hashing algorithm the bench-press of benchmarks immediately showed us results just shy of what was predicted.  Here are the results of the test.

MD545B/sec
NTLM70B/sec
MD475.2B/sec
SHA115.4/sec (updated)

To date these are some of the highest speeds achieved by a single personally owned machine.

Video of oclhc-lite record runs

Multi-hash performance obviously suffers and drops down to 14.8B/sec with 6500 hashes in MD5 using regular oclhc. The setup is relatively new and I plan to continue doing several benchmarks and post them here. For now I’m overall pretty pleased with the setup. For the test that I’ve ran ATI has surpassed Nvidia with ease. The only drawback I foresee is having to run a separate Nvidia system to support those CUDA only apps. Many thanks to those that have helped me iron out the kinks and to Atom for make such wonderful elegant tools

DG

 

 

Comments (8)

  1. 3:57 am, July 13, 2011altu  / Reply

    does this sistem erebus 2.0 work with windows 7 or only the 8 GPU`s are detected only in linux all.

    • 8:30 pm, August 1, 2011d3ad0ne  / Reply

      Will work with linux and now with updated drivers should work in windows.

  2. 12:13 am, July 28, 2011Jesus  / Reply

    What settings did you have to change on the BIOS of the TYAN Box? I have this same box with 6 GTX 590 but it wont post if i leave only 4 of them it works fine. I would appreciate it if you could tell me the settings you had change. Thank you in advanced

    • 8:31 pm, August 1, 2011d3ad0ne  / Reply

      It requried me to set it so that onboard vga adapter posted first.

  3. 8:35 am, September 28, 2011manoj  / Reply

    is there SLI support on the TYAN motherboard. If I want to do the same project, I would like to have some of them bridged.

  4. 9:04 pm, July 9, 2012Dave  / Reply

    I have a similar setup

    The Build

    Motherboard: Tyan FT72-B7015
    Processor: Dual Xeon X5680
    RAM: 144GB
    Hard Drive: OCZ Vertex III
    GPU: 8X Zotac GTX 680 2GB

    I have installed every driver version provided by NVIDIA and I’m continuing to have problems initializing the 8th GPU. The GPU’s are all operational and can be moved to any slot and function properly. There is no OC’ing with any of the GPU’s. Windows recocognizes all 8 GPU’s however it is disabling the one on PCI Bus 12? Computer managment says there is a problem and giving Code 43 (worthless info) and disabling hardware. When installing different driver versions the PCI Bus that gets disabled can switch to different GPU’s. This makes me believe that it could be a driver issue with NVIDIA. Is there any BIOS settings that could prevent 8 GPUs from operating properly? Does anyone have any suggestions that I could try? I did initialize the on-board graphics and set as priority and all 8 are running but I cannot access the NVIDIA Control Panel because there is no monitor connected to a NVIDIA GPU. I disabled on-board graphics in computer management and tried to force the PC to default to NVIDIA while keeping the onboard graphics at priority but this didn’t work either. I’m at a loss and not sure where to go from here.

    Here is the thread I started in Toms Hardware

  5. 5:09 am, August 24, 2012height increase insoles  / Reply

    Informative post! Good to see someone who knows what it is all about
    and can additionally produce common sense content for us visitors.
    without doubt looking forward to your next article.

  6. 6:01 am, December 28, 2012Juan  / Reply

    well.. nice site, why u didnt uploaded the video to youtube?, anyway, all i can say for v3, is that cRARk 4 linux with HD7970 drivers 12.6 does 25K pps,[$a $A $1 $!] * & HD6970 does arround 10K. new drivers 12.10 are much faster for FLACCL03 w7x64 800x, but cRARk w7x64 drivers 12.10 with -d1 does the same 25Kpps. 12.11beta11 NET4 have strange peaks while moving the mouse over 2D graphics like msi afterburner, but work a tiny bit faster than 12.10 in flaccl03. New record 25-gpus http://www.geeks3d.com/20121206/a-25-gpu-monster-cracks-passwords-in-opencl/

Leave a Reply

Allowed Tags - You may use these HTML tags and attributes in your comment.

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="">