Tachyum running Linux and applications with 64KB pages on FPGA

1 min read

Tachyum has announced that it has successfully achieved the required performance and stability of the Linux operating system and applications running as part of QEMU emulations.

Credit: bluebay2014 - adobe.stock.com

Consequently, the company has said that it will now expand all testing and optimisation processes of its Prodigy software distribution to 64KB page size on the FPGA prototype.

Tachyum first announced its intentions to move to 64KB page size as the default of its Prodigy software distribution – a completely integrated software stack and package available to early adopters and customers as a pre-installed image as part of beta testing – in June.

After completing testing of Linux 6.8 and applications in QEMU software emulation, Tachyum’s move to the 64KB page size is expected to yield up to 10-15% better performance than 4KB pages depending on the application. 4KB support is now only supported for special configurations and will be deprecated in Prodigy 2.

"For decades, the default translation page size on most CPU architectures was 4KB," said Dr. Radoslav Danilak, founder and CEO of Tachyum. “With the amount of data being processed and with application sizes growing, adhering to 4KB size leads to increasingly larger overhead. For increased efficiency, Tachyum decided to make 64KB page size the default in its offerings. This successful transition will ensure that the performance of Prodigy’s software distribution is optimised to meet applications’ needs.”

As a Universal Processor suitable for all workloads, Prodigy-powered data centre servers will be able to seamlessly and dynamically switch between computational domains (such as AI/ML, HPC, and cloud) with a single homogeneous architecture.

By eliminating the need for expensive dedicated AI hardware and dramatically increasing server utilisation, Prodigy reduces CAPEX and OPEX significantly while delivering improved data centre performance, power, and economics.

Prodigy integrates 192 high-performance custom-designed 64-bit compute cores, to deliver up to 4.5x the performance of the highest-performing x86 processors for cloud workloads, up to 3x that of the highest performing GPU for HPC, and 6x for AI applications.