Train Machine Learning algorithms is a complex and demanding work. Google Tensorflow API will allow you to execute the learning tasks using a distributed or GPU accelerated envionment. Also, you can publish your model using Tensorflow server, which allows you a fast movement from training to production.
Tensorflow is installed in Finis Terrae with and without GPU, so you can change the configuration that adapts better to your needs. Also, distributed executions is enabled. Additionally, you can access your training metrics and computational graphs using Tensorboard.
Finis Terrae has been conceived for the efficient resolution of large parallel problems. As a result, Finis Terrae includes more than 7700 cores, distributed in nodes with 2 Intel Xeon E5 Haswell processors. 4 nodes includes 2 additional NVIDIA Tesla K80. These nodes are interconnected by a FDR Infiniband low-latency network. They also have access to a high-performance parallel storage system based on Lustre, capable of providing simultaneously a high capacity (NET 760 Terabytes) and, above all, a high performance (greater than 20 Gigabytes per second). Thus, calculations are not delayed by the disks input/output operations and are speeded up by GPUs when you resquest them.
Access to the system is enabled by using VPN to guarantee security. There is also the possibility of using a web portal and there is a remote visualization environment to allow an easy analysis of your results from your browser.
Consumption is billed based on elapsed core hours, ech core hour includes up to 6GB of memory. The amount of memory can be increased by using additional cores. Service has a setup fee of 200€ per company (only the first time you contract a CESGA service), which includes 3 user accounts and up to 600 GB (+200 scratch). Additional user accounts are billed. User can use any free software application, whenever allowed by the corresponding license T&C (CESGA offers some CUDA-enabled libraries and applications preinstalled). The execution of your jobs is using a batch system based on Slurm without priority. A maximum number of jobs in execution is applied using this service.
Published cost is based on CESGA GPU services. Lower cost applies when you do not use GPUs, following CESGA non-prioritized, prioritized or exclusive HPC services prices. Tensorflow server is on demand using a virtual machine without GPU which is executed on CESGA Cloud services. In this case, prices of this service is applied. To get more information, contact with our Transfer department.