Vu:ensemble de données Manipuler pour tenir compte des mesures répétées
df <- data.frame(
CompanyID=c("Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers"
,"Drinkers","Drinkers", "Liquders","Liquders","Liquders","PelletCoffeeCo","PelletCoffeeCo"),
Email= c("[email protected]", "[email protected]","[email protected]","[email protected]", "[email protected]",
"[email protected]", "[email protected]", "[email protected]", "[email protected]", "[email protected]",
"[email protected]","[email protected]","[email protected]","[email protected]",
"[email protected]"),
Day= c("1","2","3","4","5","6","7","8","9","10","1","2","3","1","2"),
var1= c(4,5,5,5,2,3,2,7,6,5,7,6,6,2,3))
je dois comprendre comment se rendre à:
df2 <- data.frame(CompanyID=c("Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers","Drinkers"
,"Drinkers","Drinkers", "Liquders","Liquders","Liquders","Liquders","Liquders","Liquders",
"Liquders","Liquders","Liquders","Liquders", "PelletCoffeeCo","PelletCoffeeCo","PelletCoffeeCo",
"PelletCoffeeCo","PelletCoffeeCo","PelletCoffeeCo","PelletCoffeeCo","PelletCoffeeCo",
"PelletCoffeeCo","PelletCoffeeCo"),
Email= c("[email protected]", "[email protected]","[email protected]","[email protected]", "[email protected]",
"[email protected]", "[email protected]", "[email protected]", "[email protected]", "[email protected]",
"[email protected]","[email protected]","[email protected]","[email protected]","[email protected]",
"[email protected]","[email protected]","[email protected]","[email protected]","[email protected]","[email protected]",
"[email protected]","[email protected]","[email protected]","[email protected]",
"[email protected]","[email protected]","[email protected]","[email protected]",
"[email protected]"),
Day= c("1","2","3","4","5","6","7","8","9","10","1","2","3","4","5","6","7","8","9","10",
"1","2","3","4","5","6","7","8","9","10"),
var1= c(4,5,5,5,2,3,2,7,6,5,7,6,6, NA,NA,NA,NA,NA,NA,NA, 2,3,NA,NA,NA,NA,NA,NA,NA,NA))
Explication: J'ai données où je a sondé les gens une fois par jour sur un cours de 10 jours. Dans un monde parfait, j'aurais 10 réponses de chaque participant, notées day1: day10. Cependant, en raison de la non-réponse, certains participants ont donné 3 réponses, d'autres 6, et 10 et ainsi de suite. Je mets les données en place pour lancer un modèle de croissance, et j'ai donc besoin de la colonne Jour pour toujours lire Jour1 - Jour 10, peu importe s'il y a des données pour ces réponses. J'ai essayé de le démontrer en ajoutant NA aux lignes qui n'ont pas tous les 10 jours de données.
Comment procéder?
Merci à l'avance!
Génial! Merci beaucoup. Ça a marché comme sur des roulettes. J'ai un certain nombre d'autres variables, x1: x10, j'espère que ça va fonctionner de la même manière. Pourriez-vous expliquer les fonctions? Je vois comment cela fonctionne, mais je ne sais pas comment fonctionne l'imbrication complète et l'imbrication - et alors pourquoi le besoin d'ajouter l'argument data.frame à la fin? – D500
@ D500 - Pas de problème. Voir l'explication ajoutée ci-dessus. – www