Amino acid dipepetide frequency for Torque teno midi virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.975AlaAla: 16.975 ± 6.334
0.0AlaCys: 0.0 ± 0.0
3.086AlaAsp: 3.086 ± 1.156
4.63AlaGlu: 4.63 ± 3.15
0.772AlaPhe: 0.772 ± 0.759
3.858AlaGly: 3.858 ± 0.781
8.488AlaHis: 8.488 ± 1.936
1.543AlaIle: 1.543 ± 0.594
3.858AlaLys: 3.858 ± 1.67
4.63AlaLeu: 4.63 ± 1.596
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
1.543AlaPro: 1.543 ± 0.915
3.858AlaGln: 3.858 ± 3.053
0.772AlaArg: 0.772 ± 0.458
4.63AlaSer: 4.63 ± 2.226
6.944AlaThr: 6.944 ± 2.742
0.0AlaVal: 0.0 ± 0.0
0.772AlaTrp: 0.772 ± 0.458
0.772AlaTyr: 0.772 ± 0.458
0.0AlaXaa: 0.0 ± 0.0
Cys
0.772CysAla: 0.772 ± 0.785
1.543CysCys: 1.543 ± 0.915
2.315CysAsp: 2.315 ± 1.575
0.772CysGlu: 0.772 ± 0.458
0.772CysPhe: 0.772 ± 0.458
0.772CysGly: 0.772 ± 0.458
0.0CysHis: 0.0 ± 0.0
0.772CysIle: 0.772 ± 0.458
3.086CysLys: 3.086 ± 1.156
2.315CysLeu: 2.315 ± 1.575
0.772CysMet: 0.772 ± 0.651
0.772CysAsn: 0.772 ± 0.458
0.772CysPro: 0.772 ± 0.785
1.543CysGln: 1.543 ± 0.915
0.772CysArg: 0.772 ± 0.759
0.0CysSer: 0.0 ± 0.0
0.772CysThr: 0.772 ± 0.458
0.772CysVal: 0.772 ± 0.458
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.086AspAla: 3.086 ± 1.156
2.315AspCys: 2.315 ± 1.575
3.858AspAsp: 3.858 ± 0.781
0.0AspGlu: 0.0 ± 0.0
1.543AspPhe: 1.543 ± 0.915
0.772AspGly: 0.772 ± 0.458
0.0AspHis: 0.0 ± 0.0
3.086AspIle: 3.086 ± 1.156
1.543AspLys: 1.543 ± 0.646
3.858AspLeu: 3.858 ± 2.288
0.772AspMet: 0.772 ± 0.458
3.086AspAsn: 3.086 ± 2.071
5.401AspPro: 5.401 ± 0.901
2.315AspGln: 2.315 ± 0.74
3.858AspArg: 3.858 ± 0.781
3.858AspSer: 3.858 ± 0.781
3.086AspThr: 3.086 ± 1.156
0.772AspVal: 0.772 ± 0.458
0.0AspTrp: 0.0 ± 0.0
3.858AspTyr: 3.858 ± 0.781
0.0AspXaa: 0.0 ± 0.0
Glu
1.543GluAla: 1.543 ± 1.032
3.086GluCys: 3.086 ± 2.021
5.401GluAsp: 5.401 ± 2.725
16.975GluGlu: 16.975 ± 7.756
0.772GluPhe: 0.772 ± 0.458
3.858GluGly: 3.858 ± 1.59
3.086GluHis: 3.086 ± 1.156
7.716GluIle: 7.716 ± 2.329
6.173GluLys: 6.173 ± 2.044
4.63GluLeu: 4.63 ± 1.951
0.0GluMet: 0.0 ± 0.0
3.858GluAsn: 3.858 ± 0.781
1.543GluPro: 1.543 ± 0.594
0.772GluGln: 0.772 ± 0.759
1.543GluArg: 1.543 ± 0.915
1.543GluSer: 1.543 ± 0.915
4.63GluThr: 4.63 ± 0.961
0.772GluVal: 0.772 ± 0.458
0.772GluTrp: 0.772 ± 0.458
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.315PheAla: 2.315 ± 1.575
0.772PheCys: 0.772 ± 0.759
0.0PheAsp: 0.0 ± 0.0
1.543PheGlu: 1.543 ± 1.518
0.0PhePhe: 0.0 ± 0.0
1.543PheGly: 1.543 ± 0.915
0.772PheHis: 0.772 ± 0.458
0.772PheIle: 0.772 ± 0.458
1.543PheLys: 1.543 ± 0.594
0.772PheLeu: 0.772 ± 0.458
0.0PheMet: 0.0 ± 0.0
2.315PheAsn: 2.315 ± 0.74
3.086PhePro: 3.086 ± 1.156
0.772PheGln: 0.772 ± 0.458
0.772PheArg: 0.772 ± 0.458
0.0PheSer: 0.0 ± 0.0
2.315PheThr: 2.315 ± 0.74
0.772PheVal: 0.772 ± 0.458
0.772PheTrp: 0.772 ± 0.458
8.488PheTyr: 8.488 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
6.944GlyAla: 6.944 ± 2.667
1.543GlyCys: 1.543 ± 0.915
3.858GlyAsp: 3.858 ± 0.781
5.401GlyGlu: 5.401 ± 0.823
1.543GlyPhe: 1.543 ± 0.594
6.173GlyGly: 6.173 ± 0.97
2.315GlyHis: 2.315 ± 1.575
3.858GlyIle: 3.858 ± 0.781
2.315GlyLys: 2.315 ± 1.373
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
1.543GlyAsn: 1.543 ± 0.915
4.63GlyPro: 4.63 ± 2.745
2.315GlyGln: 2.315 ± 1.373
4.63GlyArg: 4.63 ± 1.48
0.772GlySer: 0.772 ± 0.458
2.315GlyThr: 2.315 ± 0.798
1.543GlyVal: 1.543 ± 0.915
0.772GlyTrp: 0.772 ± 0.458
0.772GlyTyr: 0.772 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
0.772HisAla: 0.772 ± 0.458
2.315HisCys: 2.315 ± 1.575
2.315HisAsp: 2.315 ± 1.575
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.772HisGly: 0.772 ± 0.458
0.0HisHis: 0.0 ± 0.0
2.315HisIle: 2.315 ± 1.575
3.858HisLys: 3.858 ± 1.527
6.944HisLeu: 6.944 ± 2.742
0.0HisMet: 0.0 ± 0.0
0.772HisAsn: 0.772 ± 0.458
2.315HisPro: 2.315 ± 1.283
1.543HisGln: 1.543 ± 0.646
0.772HisArg: 0.772 ± 0.785
0.772HisSer: 0.772 ± 0.785
2.315HisThr: 2.315 ± 1.575
0.0HisVal: 0.0 ± 0.0
0.772HisTrp: 0.772 ± 0.458
1.543HisTyr: 1.543 ± 0.646
0.0HisXaa: 0.0 ± 0.0
Ile
3.086IleAla: 3.086 ± 2.071
1.543IleCys: 1.543 ± 0.915
0.772IleAsp: 0.772 ± 0.458
3.086IleGlu: 3.086 ± 2.071
3.086IlePhe: 3.086 ± 1.156
3.858IleGly: 3.858 ± 1.481
3.086IleHis: 3.086 ± 1.156
4.63IleIle: 4.63 ± 0.549
6.173IleLys: 6.173 ± 0.615
5.401IleLeu: 5.401 ± 3.203
0.0IleMet: 0.0 ± 0.0
4.63IleAsn: 4.63 ± 1.246
3.858IlePro: 3.858 ± 1.59
2.315IleGln: 2.315 ± 1.363
2.315IleArg: 2.315 ± 1.173
3.086IleSer: 3.086 ± 1.292
2.315IleThr: 2.315 ± 1.373
2.315IleVal: 2.315 ± 1.373
0.0IleTrp: 0.0 ± 0.0
0.772IleTyr: 0.772 ± 0.458
0.0IleXaa: 0.0 ± 0.0
Lys
4.63LysAla: 4.63 ± 1.024
1.543LysCys: 1.543 ± 0.594
3.858LysAsp: 3.858 ± 0.652
6.944LysGlu: 6.944 ± 2.438
3.086LysPhe: 3.086 ± 1.83
3.858LysGly: 3.858 ± 2.288
3.858LysHis: 3.858 ± 1.378
4.63LysIle: 4.63 ± 1.951
9.259LysLys: 9.259 ± 1.403
11.574LysLeu: 11.574 ± 1.768
0.0LysMet: 0.0 ± 0.0
1.543LysAsn: 1.543 ± 0.594
6.173LysPro: 6.173 ± 2.044
4.63LysGln: 4.63 ± 1.596
5.401LysArg: 5.401 ± 3.489
3.086LysSer: 3.086 ± 1.292
10.031LysThr: 10.031 ± 1.714
0.772LysVal: 0.772 ± 0.458
1.543LysTrp: 1.543 ± 0.915
4.63LysTyr: 4.63 ± 1.18
0.0LysXaa: 0.0 ± 0.0
Leu
5.401LeuAla: 5.401 ± 0.901
0.772LeuCys: 0.772 ± 0.458
3.086LeuAsp: 3.086 ± 1.156
8.488LeuGlu: 8.488 ± 1.841
0.772LeuPhe: 0.772 ± 0.759
2.315LeuGly: 2.315 ± 1.373
2.315LeuHis: 2.315 ± 0.74
2.315LeuIle: 2.315 ± 0.798
8.488LeuLys: 8.488 ± 3.473
11.574LeuLeu: 11.574 ± 1.379
0.772LeuMet: 0.772 ± 0.785
5.401LeuAsn: 5.401 ± 0.901
2.315LeuPro: 2.315 ± 0.74
4.63LeuGln: 4.63 ± 1.951
3.086LeuArg: 3.086 ± 1.078
4.63LeuSer: 4.63 ± 1.246
6.173LeuThr: 6.173 ± 1.331
5.401LeuVal: 5.401 ± 2.349
0.772LeuTrp: 0.772 ± 0.458
2.315LeuTyr: 2.315 ± 1.373
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.772MetCys: 0.772 ± 0.458
0.0MetAsp: 0.0 ± 0.0
0.772MetGlu: 0.772 ± 0.785
0.0MetPhe: 0.0 ± 0.0
0.772MetGly: 0.772 ± 0.458
0.0MetHis: 0.0 ± 0.0
3.858MetIle: 3.858 ± 1.59
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.315MetPro: 2.315 ± 0.74
2.315MetGln: 2.315 ± 1.373
0.0MetArg: 0.0 ± 0.0
5.401MetSer: 5.401 ± 2.725
0.0MetThr: 0.0 ± 0.0
0.772MetVal: 0.772 ± 0.458
2.315MetTrp: 2.315 ± 1.575
0.772MetTyr: 0.772 ± 0.458
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.543AsnAsp: 1.543 ± 0.594
1.543AsnGlu: 1.543 ± 0.915
1.543AsnPhe: 1.543 ± 0.594
4.63AsnGly: 4.63 ± 0.549
0.0AsnHis: 0.0 ± 0.0
4.63AsnIle: 4.63 ± 1.246
4.63AsnLys: 4.63 ± 0.549
5.401AsnLeu: 5.401 ± 1.792
0.0AsnMet: 0.0 ± 0.0
1.543AsnAsn: 1.543 ± 0.915
4.63AsnPro: 4.63 ± 1.951
3.858AsnGln: 3.858 ± 1.647
6.173AsnArg: 6.173 ± 2.625
1.543AsnSer: 1.543 ± 0.915
2.315AsnThr: 2.315 ± 0.74
1.543AsnVal: 1.543 ± 0.915
0.0AsnTrp: 0.0 ± 0.0
4.63AsnTyr: 4.63 ± 1.246
0.0AsnXaa: 0.0 ± 0.0
Pro
3.858ProAla: 3.858 ± 1.061
0.0ProCys: 0.0 ± 0.0
3.086ProAsp: 3.086 ± 1.83
3.086ProGlu: 3.086 ± 1.156
6.173ProPhe: 6.173 ± 2.044
4.63ProGly: 4.63 ± 1.944
0.0ProHis: 0.0 ± 0.0
1.543ProIle: 1.543 ± 0.915
6.944ProLys: 6.944 ± 1.68
6.173ProLeu: 6.173 ± 1.237
0.772ProMet: 0.772 ± 0.458
2.315ProAsn: 2.315 ± 0.74
5.401ProPro: 5.401 ± 2.43
3.858ProGln: 3.858 ± 1.85
2.315ProArg: 2.315 ± 0.74
1.543ProSer: 1.543 ± 0.915
4.63ProThr: 4.63 ± 0.853
2.315ProVal: 2.315 ± 1.575
2.315ProTrp: 2.315 ± 0.74
2.315ProTyr: 2.315 ± 1.373
0.0ProXaa: 0.0 ± 0.0
Gln
4.63GlnAla: 4.63 ± 2.567
0.772GlnCys: 0.772 ± 0.785
0.772GlnAsp: 0.772 ± 0.458
2.315GlnGlu: 2.315 ± 1.373
0.772GlnPhe: 0.772 ± 0.759
1.543GlnGly: 1.543 ± 0.915
0.772GlnHis: 0.772 ± 0.759
0.772GlnIle: 0.772 ± 0.785
4.63GlnLys: 4.63 ± 0.961
3.086GlnLeu: 3.086 ± 0.469
3.858GlnMet: 3.858 ± 1.315
3.086GlnAsn: 3.086 ± 0.469
4.63GlnPro: 4.63 ± 1.024
5.401GlnGln: 5.401 ± 0.959
2.315GlnArg: 2.315 ± 0.659
2.315GlnSer: 2.315 ± 1.373
4.63GlnThr: 4.63 ± 0.549
0.772GlnVal: 0.772 ± 0.458
1.543GlnTrp: 1.543 ± 0.915
2.315GlnTyr: 2.315 ± 0.798
0.0GlnXaa: 0.0 ± 0.0
Arg
2.315ArgAla: 2.315 ± 0.74
0.0ArgCys: 0.0 ± 0.0
3.086ArgAsp: 3.086 ± 2.071
0.0ArgGlu: 0.0 ± 0.0
3.086ArgPhe: 3.086 ± 1.156
2.315ArgGly: 2.315 ± 0.74
0.772ArgHis: 0.772 ± 0.458
3.086ArgIle: 3.086 ± 1.83
6.173ArgLys: 6.173 ± 2.584
3.086ArgLeu: 3.086 ± 1.129
1.543ArgMet: 1.543 ± 0.837
7.716ArgAsn: 7.716 ± 1.176
3.858ArgPro: 3.858 ± 1.98
2.315ArgGln: 2.315 ± 0.659
15.432ArgArg: 15.432 ± 8.242
2.315ArgSer: 2.315 ± 0.659
1.543ArgThr: 1.543 ± 0.594
0.0ArgVal: 0.0 ± 0.0
1.543ArgTrp: 1.543 ± 0.594
3.086ArgTyr: 3.086 ± 1.83
0.0ArgXaa: 0.0 ± 0.0
Ser
4.63SerAla: 4.63 ± 0.549
0.772SerCys: 0.772 ± 0.458
1.543SerAsp: 1.543 ± 0.915
3.086SerGlu: 3.086 ± 1.292
1.543SerPhe: 1.543 ± 0.915
5.401SerGly: 5.401 ± 0.823
3.086SerHis: 3.086 ± 2.021
3.086SerIle: 3.086 ± 1.078
5.401SerLys: 5.401 ± 2.617
1.543SerLeu: 1.543 ± 0.915
3.086SerMet: 3.086 ± 1.968
0.772SerAsn: 0.772 ± 0.458
0.772SerPro: 0.772 ± 0.759
0.0SerGln: 0.0 ± 0.0
4.63SerArg: 4.63 ± 1.18
10.802SerSer: 10.802 ± 10.158
5.401SerThr: 5.401 ± 1.999
0.0SerVal: 0.0 ± 0.0
0.0SerTrp: 0.0 ± 0.0
0.772SerTyr: 0.772 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
3.086ThrAla: 3.086 ± 1.156
0.772ThrCys: 0.772 ± 0.458
6.173ThrAsp: 6.173 ± 0.97
5.401ThrGlu: 5.401 ± 0.901
0.772ThrPhe: 0.772 ± 0.458
1.543ThrGly: 1.543 ± 0.915
1.543ThrHis: 1.543 ± 1.571
5.401ThrIle: 5.401 ± 2.387
6.173ThrLys: 6.173 ± 0.7
5.401ThrLeu: 5.401 ± 0.642
1.543ThrMet: 1.543 ± 0.594
5.401ThrAsn: 5.401 ± 2.349
3.858ThrPro: 3.858 ± 1.061
6.173ThrGln: 6.173 ± 1.512
2.315ThrArg: 2.315 ± 0.74
5.401ThrSer: 5.401 ± 3.489
3.858ThrThr: 3.858 ± 1.378
4.63ThrVal: 4.63 ± 1.246
0.772ThrTrp: 0.772 ± 0.458
1.543ThrTyr: 1.543 ± 0.594
0.0ThrXaa: 0.0 ± 0.0
Val
2.315ValAla: 2.315 ± 1.575
0.772ValCys: 0.772 ± 0.458
0.772ValAsp: 0.772 ± 0.458
2.315ValGlu: 2.315 ± 0.74
0.772ValPhe: 0.772 ± 0.458
2.315ValGly: 2.315 ± 1.575
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
2.315ValLys: 2.315 ± 1.373
1.543ValLeu: 1.543 ± 0.915
1.543ValMet: 1.543 ± 0.915
1.543ValAsn: 1.543 ± 0.594
1.543ValPro: 1.543 ± 0.915
1.543ValGln: 1.543 ± 0.915
0.772ValArg: 0.772 ± 0.458
1.543ValSer: 1.543 ± 0.915
1.543ValThr: 1.543 ± 0.915
0.0ValVal: 0.0 ± 0.0
0.772ValTrp: 0.772 ± 0.458
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.772TrpAsp: 0.772 ± 0.458
0.772TrpGlu: 0.772 ± 0.458
0.772TrpPhe: 0.772 ± 0.458
1.543TrpGly: 1.543 ± 0.915
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.772TrpLys: 0.772 ± 0.458
2.315TrpLeu: 2.315 ± 0.74
3.086TrpMet: 3.086 ± 1.156
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.543TrpGln: 1.543 ± 0.915
1.543TrpArg: 1.543 ± 0.594
0.0TrpSer: 0.0 ± 0.0
0.772TrpThr: 0.772 ± 0.458
0.0TrpVal: 0.0 ± 0.0
0.772TrpTrp: 0.772 ± 0.458
1.543TrpTyr: 1.543 ± 0.915
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.772TyrAla: 0.772 ± 0.785
0.0TyrCys: 0.0 ± 0.0
0.772TyrAsp: 0.772 ± 0.458
1.543TyrGlu: 1.543 ± 0.915
1.543TyrPhe: 1.543 ± 0.915
0.772TyrGly: 0.772 ± 0.458
0.0TyrHis: 0.0 ± 0.0
2.315TyrIle: 2.315 ± 1.575
7.716TyrLys: 7.716 ± 1.197
0.772TyrLeu: 0.772 ± 0.785
1.543TyrMet: 1.543 ± 0.915
3.858TyrAsn: 3.858 ± 0.781
4.63TyrPro: 4.63 ± 1.909
0.0TyrGln: 0.0 ± 0.0
3.858TyrArg: 3.858 ± 2.288
3.086TyrSer: 3.086 ± 1.83
6.173TyrThr: 6.173 ± 0.615
0.772TyrVal: 0.772 ± 0.458
0.0TyrTrp: 0.0 ± 0.0
0.772TyrTyr: 0.772 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1297 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski