How to efficiently integrate big data without using memory / (How to create big data)
Mostrar comentarios más antiguos
- in a study i will produce large arrays.
- Each array will have at least 500 MB size.
- Each array will have the same number of rows.
- the total size of dataset will be approximately 20 GB or over.
- Somehow I have to create a single variable/array which includes all data and size of 20 GB.
matfile seems a good solution. However when the size of file increases, it gets slower. How can i handle this problem?
9 comentarios
blaat
el 18 de Ag. de 2015
How you store and access big data is strongly dependent on what you need to do with it. Do you need all arrays at the same time? Do you need specific values from these arrays at the same time? Can you somehow partition your calculations to a subset of the data?
Without more information it is very difficult to give advice on your problem.
Mehmet OZC
el 18 de Ag. de 2015
Mehmet OZC
el 18 de Ag. de 2015
blaat
el 18 de Ag. de 2015
If can process the arrays separately, perhaps it would be more convenient to keep them as separated files. Or is there another reason you want a single, large file?
If a single file is required, I would advise against storing everything in a single variable. As far as I know, there is no way of reading only part of an array from a .mat-file, so the file will require 20 GB of memory to load.
Steven Lord
el 18 de Ag. de 2015
Why do you need to write them to a single file? Why not put each in its own file; that way if something were to happen to one of the files you wouldn't lose all of your data?
Mehmet OZC
el 18 de Ag. de 2015
Editada: Mehmet OZC
el 18 de Ag. de 2015
blaat
el 18 de Ag. de 2015
So, if I understand correctly, your problem is this: you want to write 20 GB of data to a single variable in a .mat-file, but it's getting unworkably slow? Or doesn't it work at all?
Mehmet OZC
el 18 de Ag. de 2015
Editada: Mehmet OZC
el 18 de Ag. de 2015
Walter Roberson
el 18 de Ag. de 2015
I wonder if compression is leading to slowdowns? I do not know whether -v7.3 with matfile uses compression; see discussion http://www.mathworks.com/matlabcentral/answers/15521-matlab-function-save-and-v7-3 and http://www.mathworks.com/matlabcentral/answers/137592-compress-only-selected-variables-when-saving-to-mat
Respuesta aceptada
Más respuestas (0)
Categorías
Más información sobre Standard File Formats en Centro de ayuda y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!