'Functions' dataset collection

Collection of synthetic datasets generated applying continuous and limitated functions to a closed subset of its own domain; there are datasets done with scalar and vectorial functions and/or one-variable and multivariate functions. Datasets are in .csv format with header (each header column corresponds to a name of a variable) and in some case there is also the .arff format (for Weka); there are training/learning files and test files: first ones are bigger than seconds ones and the major part of data contained in test files are not present in training/learning ones.

One-variable real-valued continuous and limited function datasets:   $y=f(x) : {\rm I\!R} \to {\rm I\!R}$

ID Formula Domain Train Info Train Links Test Info Test Links
RtoR/001 $y=\frac{1}{2} x^3 - 2 x^2 - 3 x - 1$ $[-10, 10]$ Instances: 2001, discretization: 0.01 .csv   .arff Instances: 446, discretization: 0.045 .csv   .arff
RtoR/002 $y=\sin x$ $[-2 \pi, 2 \pi]$ Instances: 1257, discretization: 0.01 .csv   .arff Instances: 280, discretization: 0.045 .csv   .arff
RtoR/003 $y=e^x$ $[-5, 5]$ Instances: 1001, discretization: 0.01 .csv   .arff Instances: 223, discretization: 0.045 .csv   .arff
RtoR/004 $y=\sqrt{|x|}$ $[-5, 5]$ Instances: 1001, discretization: 0.01 .csv   .arff Instances: 223, discretization: 0.045 .csv   .arff
RtoR/005 $y=\ln{|x|}$ $[-5, 5]$ Instances: 1001, discretization: 0.01 .csv   .arff Instances: 223, discretization: 0.045 .csv   .arff
RtoR/006 $y=\arctan{|x|}$ $[-5, 5]$ Instances: 1001, discretization: 0.01 .csv   .arff Instances: 223, discretization: 0.045 .csv   .arff
RtoR/007 $y=e^{\sin x}$ $[-2 \pi, 2 \pi]$ Instances: 1257, discretization: 0.01 .csv   .arff Instances: 280, discretization: 0.045 .csv   .arff
RtoR/008 $y=\tanh x$ $[-5, 5]$ Instances: 1001, discretization: 0.01 .csv   .arff Instances: 223, discretization: 0.045 .csv   .arff
RtoR/009 $y=\frac{\sin 2x}{e^\frac{x}{5}}$ $[-20, 20]$ Instances: 4000, discretization: 0.01 .csv   .arff Instances: 889, discretization: 0.045 .csv   .arff

Download of the datasets

The datasets are available on GitHub.
These materials are distributed under MIT license; feel free to use, share, fork and adapt these materials as you see fit.
Also please feel free to submit pull-requests and bug-reports to this GitHub repository or contact me on my social media channels available on the top right corner of this page.