Amino acid dipepetide frequency for Gyrovirus GyV7-SF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.01AlaAla: 6.01 ± 2.755
2.404AlaCys: 2.404 ± 0.901
4.808AlaAsp: 4.808 ± 1.801
0.0AlaGlu: 0.0 ± 0.0
2.404AlaPhe: 2.404 ± 0.901
4.808AlaGly: 4.808 ± 1.801
2.404AlaHis: 2.404 ± 1.966
2.404AlaIle: 2.404 ± 1.406
4.808AlaLys: 4.808 ± 0.85
6.01AlaLeu: 6.01 ± 2.755
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
3.606AlaPro: 3.606 ± 3.674
3.606AlaGln: 3.606 ± 2.108
6.01AlaArg: 6.01 ± 2.451
7.212AlaSer: 7.212 ± 2.821
9.615AlaThr: 9.615 ± 4.123
2.404AlaVal: 2.404 ± 1.406
1.202AlaTrp: 1.202 ± 0.703
1.202AlaTyr: 1.202 ± 0.703
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.202CysAsp: 1.202 ± 0.703
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.202CysGly: 1.202 ± 1.223
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.202CysLeu: 1.202 ± 0.703
0.0CysMet: 0.0 ± 0.0
1.202CysAsn: 1.202 ± 1.223
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.404CysArg: 2.404 ± 1.406
6.01CysSer: 6.01 ± 4.744
1.202CysThr: 1.202 ± 1.223
0.0CysVal: 0.0 ± 0.0
1.202CysTrp: 1.202 ± 0.703
1.202CysTyr: 1.202 ± 0.703
0.0CysXaa: 0.0 ± 0.0
Asp
4.808AspAla: 4.808 ± 1.801
2.404AspCys: 2.404 ± 1.656
6.01AspAsp: 6.01 ± 4.438
4.808AspGlu: 4.808 ± 3.227
2.404AspPhe: 2.404 ± 0.901
3.606AspGly: 3.606 ± 1.056
0.0AspHis: 0.0 ± 0.0
1.202AspIle: 1.202 ± 1.223
1.202AspLys: 1.202 ± 0.703
6.01AspLeu: 6.01 ± 4.518
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
6.01AspPro: 6.01 ± 1.833
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
4.808AspSer: 4.808 ± 3.227
3.606AspThr: 3.606 ± 1.056
2.404AspVal: 2.404 ± 0.901
2.404AspTrp: 2.404 ± 1.406
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.202GluAla: 1.202 ± 1.905
1.202GluCys: 1.202 ± 1.223
3.606GluAsp: 3.606 ± 3.668
1.202GluGlu: 1.202 ± 0.703
0.0GluPhe: 0.0 ± 0.0
2.404GluGly: 2.404 ± 2.445
2.404GluHis: 2.404 ± 1.656
2.404GluIle: 2.404 ± 1.966
2.404GluLys: 2.404 ± 0.901
2.404GluLeu: 2.404 ± 0.901
1.202GluMet: 1.202 ± 0.703
2.404GluAsn: 2.404 ± 1.966
2.404GluPro: 2.404 ± 0.901
3.606GluGln: 3.606 ± 3.674
1.202GluArg: 1.202 ± 1.223
4.808GluSer: 4.808 ± 2.811
1.202GluThr: 1.202 ± 1.905
1.202GluVal: 1.202 ± 0.703
0.0GluTrp: 0.0 ± 0.0
2.404GluTyr: 2.404 ± 1.406
0.0GluXaa: 0.0 ± 0.0
Phe
2.404PheAla: 2.404 ± 0.901
0.0PheCys: 0.0 ± 0.0
1.202PheAsp: 1.202 ± 1.223
0.0PheGlu: 0.0 ± 0.0
3.606PhePhe: 3.606 ± 2.108
1.202PheGly: 1.202 ± 0.703
1.202PheHis: 1.202 ± 0.703
1.202PheIle: 1.202 ± 1.223
0.0PheLys: 0.0 ± 0.0
1.202PheLeu: 1.202 ± 0.703
1.202PheMet: 1.202 ± 0.703
4.808PheAsn: 4.808 ± 2.811
0.0PhePro: 0.0 ± 0.0
2.404PheGln: 2.404 ± 0.901
7.212PheArg: 7.212 ± 2.112
1.202PheSer: 1.202 ± 0.703
1.202PheThr: 1.202 ± 0.703
1.202PheVal: 1.202 ± 1.223
0.0PheTrp: 0.0 ± 0.0
4.808PheTyr: 4.808 ± 2.811
0.0PheXaa: 0.0 ± 0.0
Gly
3.606GlyAla: 3.606 ± 1.056
2.404GlyCys: 2.404 ± 0.901
4.808GlyAsp: 4.808 ± 3.227
1.202GlyGlu: 1.202 ± 0.703
1.202GlyPhe: 1.202 ± 0.703
8.413GlyGly: 8.413 ± 3.507
1.202GlyHis: 1.202 ± 0.703
4.808GlyIle: 4.808 ± 1.982
3.606GlyLys: 3.606 ± 2.029
1.202GlyLeu: 1.202 ± 1.223
0.0GlyMet: 0.0 ± 0.0
2.404GlyAsn: 2.404 ± 1.406
4.808GlyPro: 4.808 ± 3.227
4.808GlyGln: 4.808 ± 1.552
4.808GlyArg: 4.808 ± 0.85
8.413GlySer: 8.413 ± 5.254
4.808GlyThr: 4.808 ± 0.85
3.606GlyVal: 3.606 ± 1.056
2.404GlyTrp: 2.404 ± 0.901
1.202GlyTyr: 1.202 ± 1.223
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.202HisAsp: 1.202 ± 1.223
1.202HisGlu: 1.202 ± 1.223
0.0HisPhe: 0.0 ± 0.0
1.202HisGly: 1.202 ± 1.223
1.202HisHis: 1.202 ± 0.703
3.606HisIle: 3.606 ± 2.029
1.202HisLys: 1.202 ± 1.905
1.202HisLeu: 1.202 ± 1.905
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
4.808HisArg: 4.808 ± 2.811
1.202HisSer: 1.202 ± 0.703
2.404HisThr: 2.404 ± 1.656
1.202HisVal: 1.202 ± 0.703
2.404HisTrp: 2.404 ± 0.901
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.404IleAla: 2.404 ± 2.445
2.404IleCys: 2.404 ± 0.901
1.202IleAsp: 1.202 ± 0.703
1.202IleGlu: 1.202 ± 1.223
0.0IlePhe: 0.0 ± 0.0
2.404IleGly: 2.404 ± 3.811
1.202IleHis: 1.202 ± 1.223
0.0IleIle: 0.0 ± 0.0
2.404IleLys: 2.404 ± 1.406
2.404IleLeu: 2.404 ± 3.811
0.0IleMet: 0.0 ± 0.0
2.404IleAsn: 2.404 ± 1.406
4.808IlePro: 4.808 ± 1.552
0.0IleGln: 0.0 ± 0.0
4.808IleArg: 4.808 ± 1.801
0.0IleSer: 0.0 ± 0.0
4.808IleThr: 4.808 ± 3.312
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
2.404IleTyr: 2.404 ± 1.656
0.0IleXaa: 0.0 ± 0.0
Lys
1.202LysAla: 1.202 ± 0.703
0.0LysCys: 0.0 ± 0.0
1.202LysAsp: 1.202 ± 1.223
3.606LysGlu: 3.606 ± 3.5
3.606LysPhe: 3.606 ± 1.056
1.202LysGly: 1.202 ± 0.703
0.0LysHis: 0.0 ± 0.0
1.202LysIle: 1.202 ± 0.703
3.606LysLys: 3.606 ± 2.108
4.808LysLeu: 4.808 ± 3.312
0.0LysMet: 0.0 ± 0.0
1.202LysAsn: 1.202 ± 1.905
1.202LysPro: 1.202 ± 0.703
2.404LysGln: 2.404 ± 0.901
3.606LysArg: 3.606 ± 1.342
1.202LysSer: 1.202 ± 0.703
8.413LysThr: 8.413 ± 3.772
1.202LysVal: 1.202 ± 0.703
1.202LysTrp: 1.202 ± 0.703
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.606LeuAla: 3.606 ± 1.342
0.0LeuCys: 0.0 ± 0.0
4.808LeuAsp: 4.808 ± 0.85
2.404LeuGlu: 2.404 ± 0.901
1.202LeuPhe: 1.202 ± 0.703
8.413LeuGly: 8.413 ± 1.849
0.0LeuHis: 0.0 ± 0.0
2.404LeuIle: 2.404 ± 3.811
0.0LeuLys: 0.0 ± 0.0
10.817LeuLeu: 10.817 ± 2.762
2.404LeuMet: 2.404 ± 1.193
2.404LeuAsn: 2.404 ± 0.901
9.615LeuPro: 9.615 ± 0.868
1.202LeuGln: 1.202 ± 0.703
8.413LeuArg: 8.413 ± 4.48
7.212LeuSer: 7.212 ± 5.898
3.606LeuThr: 3.606 ± 1.342
2.404LeuVal: 2.404 ± 1.656
1.202LeuTrp: 1.202 ± 0.703
2.404LeuTyr: 2.404 ± 1.656
0.0LeuXaa: 0.0 ± 0.0
Met
3.606MetAla: 3.606 ± 1.686
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.202MetGlu: 1.202 ± 1.905
1.202MetPhe: 1.202 ± 0.703
1.202MetGly: 1.202 ± 0.703
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.202MetLys: 1.202 ± 0.703
1.202MetLeu: 1.202 ± 1.223
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.202MetPro: 1.202 ± 1.905
1.202MetGln: 1.202 ± 0.703
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
3.606MetThr: 3.606 ± 2.108
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.202MetTyr: 1.202 ± 0.703
0.0MetXaa: 0.0 ± 0.0
Asn
2.404AsnAla: 2.404 ± 1.656
2.404AsnCys: 2.404 ± 0.901
0.0AsnAsp: 0.0 ± 0.0
2.404AsnGlu: 2.404 ± 1.656
1.202AsnPhe: 1.202 ± 1.223
2.404AsnGly: 2.404 ± 0.901
1.202AsnHis: 1.202 ± 0.703
0.0AsnIle: 0.0 ± 0.0
2.404AsnLys: 2.404 ± 0.901
1.202AsnLeu: 1.202 ± 0.703
2.404AsnMet: 2.404 ± 1.211
1.202AsnAsn: 1.202 ± 0.703
6.01AsnPro: 6.01 ± 2.165
0.0AsnGln: 0.0 ± 0.0
2.404AsnArg: 2.404 ± 1.656
3.606AsnSer: 3.606 ± 1.342
3.606AsnThr: 3.606 ± 2.108
2.404AsnVal: 2.404 ± 1.406
2.404AsnTrp: 2.404 ± 0.901
2.404AsnTyr: 2.404 ± 2.445
0.0AsnXaa: 0.0 ± 0.0
Pro
9.615ProAla: 9.615 ± 0.868
1.202ProCys: 1.202 ± 1.223
1.202ProAsp: 1.202 ± 0.703
3.606ProGlu: 3.606 ± 1.686
2.404ProPhe: 2.404 ± 0.901
7.212ProGly: 7.212 ± 0.57
1.202ProHis: 1.202 ± 1.223
2.404ProIle: 2.404 ± 0.901
0.0ProLys: 0.0 ± 0.0
3.606ProLeu: 3.606 ± 2.108
2.404ProMet: 2.404 ± 1.656
6.01ProAsn: 6.01 ± 2.165
7.212ProPro: 7.212 ± 4.217
3.606ProGln: 3.606 ± 2.029
4.808ProArg: 4.808 ± 1.552
6.01ProSer: 6.01 ± 1.833
3.606ProThr: 3.606 ± 2.029
6.01ProVal: 6.01 ± 1.262
0.0ProTrp: 0.0 ± 0.0
4.808ProTyr: 4.808 ± 2.811
0.0ProXaa: 0.0 ± 0.0
Gln
3.606GlnAla: 3.606 ± 1.686
0.0GlnCys: 0.0 ± 0.0
4.808GlnAsp: 4.808 ± 3.227
3.606GlnGlu: 3.606 ± 2.029
0.0GlnPhe: 0.0 ± 0.0
3.606GlnGly: 3.606 ± 1.056
0.0GlnHis: 0.0 ± 0.0
2.404GlnIle: 2.404 ± 1.406
2.404GlnLys: 2.404 ± 0.901
3.606GlnLeu: 3.606 ± 1.686
0.0GlnMet: 0.0 ± 0.0
1.202GlnAsn: 1.202 ± 1.905
3.606GlnPro: 3.606 ± 1.056
0.0GlnGln: 0.0 ± 0.0
2.404GlnArg: 2.404 ± 0.901
2.404GlnSer: 2.404 ± 1.406
2.404GlnThr: 2.404 ± 2.445
1.202GlnVal: 1.202 ± 1.905
0.0GlnTrp: 0.0 ± 0.0
1.202GlnTyr: 1.202 ± 0.703
0.0GlnXaa: 0.0 ± 0.0
Arg
7.212ArgAla: 7.212 ± 2.702
0.0ArgCys: 0.0 ± 0.0
3.606ArgAsp: 3.606 ± 1.056
0.0ArgGlu: 0.0 ± 0.0
4.808ArgPhe: 4.808 ± 2.811
6.01ArgGly: 6.01 ± 1.833
3.606ArgHis: 3.606 ± 1.686
1.202ArgIle: 1.202 ± 1.905
6.01ArgLys: 6.01 ± 1.262
4.808ArgLeu: 4.808 ± 0.85
2.404ArgMet: 2.404 ± 1.656
2.404ArgAsn: 2.404 ± 0.901
6.01ArgPro: 6.01 ± 2.451
2.404ArgGln: 2.404 ± 1.966
16.827ArgArg: 16.827 ± 5.469
8.413ArgSer: 8.413 ± 1.849
2.404ArgThr: 2.404 ± 1.966
7.212ArgVal: 7.212 ± 1.237
3.606ArgTrp: 3.606 ± 1.056
2.404ArgTyr: 2.404 ± 1.656
0.0ArgXaa: 0.0 ± 0.0
Ser
13.221SerAla: 13.221 ± 5.579
0.0SerCys: 0.0 ± 0.0
2.404SerAsp: 2.404 ± 1.406
4.808SerGlu: 4.808 ± 3.932
4.808SerPhe: 4.808 ± 1.552
6.01SerGly: 6.01 ± 2.98
2.404SerHis: 2.404 ± 2.445
2.404SerIle: 2.404 ± 1.966
6.01SerLys: 6.01 ± 3.267
7.212SerLeu: 7.212 ± 1.237
2.404SerMet: 2.404 ± 1.406
2.404SerAsn: 2.404 ± 0.901
4.808SerPro: 4.808 ± 1.552
1.202SerGln: 1.202 ± 1.223
4.808SerArg: 4.808 ± 0.85
8.413SerSer: 8.413 ± 1.849
4.808SerThr: 4.808 ± 4.89
3.606SerVal: 3.606 ± 1.342
0.0SerTrp: 0.0 ± 0.0
2.404SerTyr: 2.404 ± 1.406
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
2.404ThrCys: 2.404 ± 0.901
2.404ThrAsp: 2.404 ± 0.901
6.01ThrGlu: 6.01 ± 1.833
1.202ThrPhe: 1.202 ± 0.703
4.808ThrGly: 4.808 ± 0.85
1.202ThrHis: 1.202 ± 1.223
2.404ThrIle: 2.404 ± 1.966
0.0ThrLys: 0.0 ± 0.0
7.212ThrLeu: 7.212 ± 2.335
0.0ThrMet: 0.0 ± 0.0
3.606ThrAsn: 3.606 ± 1.686
8.413ThrPro: 8.413 ± 5.251
7.212ThrGln: 7.212 ± 0.57
6.01ThrArg: 6.01 ± 3.267
3.606ThrSer: 3.606 ± 3.5
4.808ThrThr: 4.808 ± 0.85
8.413ThrVal: 8.413 ± 4.343
3.606ThrTrp: 3.606 ± 2.029
3.606ThrTyr: 3.606 ± 1.056
0.0ThrXaa: 0.0 ± 0.0
Val
2.404ValAla: 2.404 ± 1.406
0.0ValCys: 0.0 ± 0.0
3.606ValAsp: 3.606 ± 3.5
1.202ValGlu: 1.202 ± 1.223
3.606ValPhe: 3.606 ± 2.108
1.202ValGly: 1.202 ± 1.223
1.202ValHis: 1.202 ± 0.703
2.404ValIle: 2.404 ± 1.406
1.202ValLys: 1.202 ± 1.905
4.808ValLeu: 4.808 ± 3.312
1.202ValMet: 1.202 ± 0.703
6.01ValAsn: 6.01 ± 1.833
2.404ValPro: 2.404 ± 1.406
0.0ValGln: 0.0 ± 0.0
6.01ValArg: 6.01 ± 1.262
7.212ValSer: 7.212 ± 1.237
3.606ValThr: 3.606 ± 1.342
1.202ValVal: 1.202 ± 0.703
1.202ValTrp: 1.202 ± 0.703
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.404TrpAla: 2.404 ± 1.406
0.0TrpCys: 0.0 ± 0.0
2.404TrpAsp: 2.404 ± 0.901
1.202TrpGlu: 1.202 ± 0.703
1.202TrpPhe: 1.202 ± 1.223
1.202TrpGly: 1.202 ± 0.703
1.202TrpHis: 1.202 ± 0.703
0.0TrpIle: 0.0 ± 0.0
1.202TrpLys: 1.202 ± 0.703
1.202TrpLeu: 1.202 ± 1.223
0.0TrpMet: 0.0 ± 0.894
1.202TrpAsn: 1.202 ± 0.703
3.606TrpPro: 3.606 ± 2.108
3.606TrpGln: 3.606 ± 1.056
2.404TrpArg: 2.404 ± 1.406
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
2.404TrpTrp: 2.404 ± 1.406
1.202TrpTyr: 1.202 ± 1.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.202TyrAla: 1.202 ± 1.905
0.0TyrCys: 0.0 ± 0.0
1.202TyrAsp: 1.202 ± 0.703
0.0TyrGlu: 0.0 ± 0.0
1.202TyrPhe: 1.202 ± 0.703
1.202TyrGly: 1.202 ± 1.223
1.202TyrHis: 1.202 ± 0.703
2.404TyrIle: 2.404 ± 1.406
1.202TyrLys: 1.202 ± 0.703
2.404TyrLeu: 2.404 ± 1.656
0.0TyrMet: 0.0 ± 0.0
1.202TyrAsn: 1.202 ± 1.223
1.202TyrPro: 1.202 ± 0.703
1.202TyrGln: 1.202 ± 0.703
2.404TyrArg: 2.404 ± 0.901
2.404TyrSer: 2.404 ± 1.406
7.212TyrThr: 7.212 ± 0.57
4.808TyrVal: 4.808 ± 2.811
2.404TyrTrp: 2.404 ± 1.406
3.606TyrTyr: 3.606 ± 2.108
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski