Checking repetition of random data

4 visualizaciones (últimos 30 días)
Student for ever
Student for ever el 4 de En. de 2018
Editada: Student for ever el 9 de En. de 2018
I heed your help please. I made a random data for example T1 = randn (1000,1); T2= randn (1000,1); .... T100=randn (1000,1); and I want check whether there is any repetition for T's if so then remove it. How can I do that ?? Thanks in advance :)
Regards, Ahmed
  11 comentarios
Star Strider
Star Strider el 7 de En. de 2018
@Ahmed — See the documentation on rng (link), and more generally, the discussion on Generate Random Numbers That Are Repeatable (link).
Student for ever
Student for ever el 7 de En. de 2018
Editada: Jan el 7 de En. de 2018
Dear Jan,
I am new in matlab :), May my question is not clear, but your answer it is so close of what I want to do I think. I have 200 timeseries, which were come from parallel computation and I just want make sure that 200 are not repeated (what I mean, if I make plot for them they should give me different graphs). So, I put all the 200 timeseries as a matrix, it will be 200 column , then I just want check these columns not the same.

Iniciar sesión para comentar.

Respuesta aceptada

Jan
Jan el 4 de En. de 2018
Editada: Jan el 8 de En. de 2018
Do not create a list of variables called T1, T2, ... See https://www.mathworks.com/matlabcentral/answers/57445-faq-how-can-i-create-variables-a1-a2-a10-in-a-loop. Use a cell or multidimensional array instead.
I assume your problem is to have no repeated values inside each vector and between all vectors. Then you need 1000*100 different random numbers at first:
ready = false;
while ~ready
Pool = rand(1, 100000);
ready = (length(unique(Pool)) == length(Pool));
end
T = reshape(Pool, 1000, 100);
Maybe this is faster:
ready = all(diff(sort(Pool)));
[EDITED] If all you want is to create a unique set of vectors, and randn was just an example to create test data for the forum:
[T, Idx] = unique(T, 'rows')
[EDITED] And for unique columns:
T = unique(T.', 'rows').'

Más respuestas (2)

Birdman
Birdman el 4 de En. de 2018
Firstly, generate random data as follows:
T=randn(1000,100);
Secondly, as Adam said, use unique function to check repetitions.
Tun=unique(T,'stable');
stable command helps to protect the initial order of values.
  5 comentarios
Jan
Jan el 4 de En. de 2018
A "habit"? :-) I'd suggest to use time consuming methods only, if they are needed for the results.
Birdman
Birdman el 4 de En. de 2018
It is needed for result, exactly.

Iniciar sesión para comentar.


John BG
John BG el 5 de En. de 2018
Hi Ahmed
so far, the supplied answers increase the probability to generate all-different, random Ts.
Each of the answers improves generation randomness, yet if you really want to make sure that all T sequences are different, once generated, let's say you don't really have control on the randomness of the data and the the suggested randn(1000,1) is you model, then there's no other way than comparing them by pairs.
1.
Let be N the amount of T sequences
N=5
2.
then all possible pairs of T sequences are
L=combinator(N,2,'c')
=
1 2
1 3
1 4
1 5
2 3
2 4
2 5
3 4
3 5
4 5
3.
As Jan Simon mentions, sometimes it's more practical to put all data in a structure that can be indexed, instead of working with N different sequence names.
Let be T all your input Ti sequences compiled into a single matrix
T=randi([1 10],N)
T =
8 2 3 9 3
3 5 8 10 9
7 10 3 6 3
7 4 6 2 9
2 6 7 2 3
4.
Checking there are no 2 equal sequences
D=[0 0];
for k=1:1:size(L,1)
if isequal(T(L(k,1),:),T(L(k,2),:))
D=[D;L(k,:)];
end
end
5.
Removing repeated sequences
if size(D,1)>1
D(1,:)=[];
T(D(:,1),:)=[]; % removing one of the repeated identical pairs
end
T
.
Ahmed, I have overwritten some sequences on purpose, so the counter D shows spotted repeated sequences and these simple lines remove all repetition without losing data (when more than one repetition of same given sequence) and it works.
If you find this answer useful would you please be so kind to consider marking my answer as Accepted Answer?
To any other reader, if you find this answer useful please consider clicking on the thumbs-up vote link
thanks in advance for time and attention
John BG
  12 comentarios
Stephen23
Stephen23 el 8 de En. de 2018
Editada: Stephen23 el 8 de En. de 2018
The help clearly states that "Answers can only be accepted by someone other than the author of the question after 7 days of inactivity from the author".
Student for ever
Student for ever el 9 de En. de 2018
Editada: Student for ever el 9 de En. de 2018
Thanks all for helping, your comments its really useful for me. @John BG, I already accept Jan's answer.

Iniciar sesión para comentar.

Categorías

Más información sobre Logical en Help Center y File Exchange.

Etiquetas

Aún no se han introducido etiquetas.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by