Amino acid dipepetide frequency for Torque teno mini virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.128AlaAla: 2.128 ± 1.049
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
2.128AlaGlu: 2.128 ± 1.847
1.064AlaPhe: 1.064 ± 0.525
2.128AlaGly: 2.128 ± 1.847
2.128AlaHis: 2.128 ± 1.049
2.128AlaIle: 2.128 ± 1.847
3.191AlaLys: 3.191 ± 1.602
2.128AlaLeu: 2.128 ± 1.049
1.064AlaMet: 1.064 ± 0.525
2.128AlaAsn: 2.128 ± 1.049
4.255AlaPro: 4.255 ± 1.648
2.128AlaGln: 2.128 ± 1.847
2.128AlaArg: 2.128 ± 1.049
3.191AlaSer: 3.191 ± 1.574
2.128AlaThr: 2.128 ± 1.049
1.064AlaVal: 1.064 ± 0.525
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.128CysAsp: 2.128 ± 1.847
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.064CysGly: 1.064 ± 0.525
2.128CysHis: 2.128 ± 2.728
0.0CysIle: 0.0 ± 0.0
3.191CysLys: 3.191 ± 1.448
1.064CysLeu: 1.064 ± 2.161
1.064CysMet: 1.064 ± 1.57
1.064CysAsn: 1.064 ± 0.525
1.064CysPro: 1.064 ± 0.525
2.128CysGln: 2.128 ± 1.847
2.128CysArg: 2.128 ± 1.556
1.064CysSer: 1.064 ± 1.816
1.064CysThr: 1.064 ± 0.525
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.128AspAsp: 2.128 ± 4.322
4.255AspGlu: 4.255 ± 1.597
5.319AspPhe: 5.319 ± 2.271
1.064AspGly: 1.064 ± 2.161
1.064AspHis: 1.064 ± 0.525
0.0AspIle: 0.0 ± 0.0
2.128AspLys: 2.128 ± 2.805
2.128AspLeu: 2.128 ± 1.847
0.0AspMet: 0.0 ± 0.0
1.064AspAsn: 1.064 ± 0.525
5.319AspPro: 5.319 ± 2.271
0.0AspGln: 0.0 ± 0.0
2.128AspArg: 2.128 ± 1.049
2.128AspSer: 2.128 ± 1.556
5.319AspThr: 5.319 ± 4.969
4.255AspVal: 4.255 ± 1.597
1.064AspTrp: 1.064 ± 0.525
7.447AspTyr: 7.447 ± 2.321
0.0AspXaa: 0.0 ± 0.0
Glu
2.128GluAla: 2.128 ± 1.847
0.0GluCys: 0.0 ± 0.0
4.255GluAsp: 4.255 ± 3.694
3.191GluGlu: 3.191 ± 1.574
0.0GluPhe: 0.0 ± 0.0
3.191GluGly: 3.191 ± 1.448
1.064GluHis: 1.064 ± 0.525
2.128GluIle: 2.128 ± 1.049
7.447GluLys: 7.447 ± 3.2
5.319GluLeu: 5.319 ± 2.271
0.0GluMet: 0.0 ± 1.711
1.064GluAsn: 1.064 ± 0.525
5.319GluPro: 5.319 ± 1.848
0.0GluGln: 0.0 ± 0.0
3.191GluArg: 3.191 ± 1.645
3.191GluSer: 3.191 ± 5.448
5.319GluThr: 5.319 ± 2.271
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
3.191GluTyr: 3.191 ± 1.574
0.0GluXaa: 0.0 ± 0.0
Phe
3.191PheAla: 3.191 ± 1.645
1.064PheCys: 1.064 ± 1.816
2.128PheAsp: 2.128 ± 1.722
4.255PheGlu: 4.255 ± 1.648
0.0PhePhe: 0.0 ± 0.0
2.128PheGly: 2.128 ± 1.847
1.064PheHis: 1.064 ± 0.525
0.0PheIle: 0.0 ± 0.0
3.191PheLys: 3.191 ± 1.602
1.064PheLeu: 1.064 ± 1.816
1.064PheMet: 1.064 ± 0.525
0.0PheAsn: 0.0 ± 0.0
3.191PhePro: 3.191 ± 1.574
3.191PheGln: 3.191 ± 1.574
1.064PheArg: 1.064 ± 0.525
4.255PheSer: 4.255 ± 1.597
4.255PheThr: 4.255 ± 1.648
1.064PheVal: 1.064 ± 1.816
1.064PheTrp: 1.064 ± 0.525
3.191PheTyr: 3.191 ± 1.602
0.0PheXaa: 0.0 ± 0.0
Gly
2.128GlyAla: 2.128 ± 1.049
2.128GlyCys: 2.128 ± 1.049
4.255GlyAsp: 4.255 ± 4.122
2.128GlyGlu: 2.128 ± 1.847
2.128GlyPhe: 2.128 ± 1.049
3.191GlyGly: 3.191 ± 1.645
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
1.064GlyLys: 1.064 ± 0.525
5.319GlyLeu: 5.319 ± 1.716
1.064GlyMet: 1.064 ± 1.663
5.319GlyAsn: 5.319 ± 1.761
3.191GlyPro: 3.191 ± 2.911
1.064GlyGln: 1.064 ± 0.525
1.064GlyArg: 1.064 ± 0.525
1.064GlySer: 1.064 ± 0.525
3.191GlyThr: 3.191 ± 1.645
1.064GlyVal: 1.064 ± 2.161
1.064GlyTrp: 1.064 ± 0.525
1.064GlyTyr: 1.064 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 2.161
1.064HisCys: 1.064 ± 1.816
2.128HisAsp: 2.128 ± 1.847
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.191HisLys: 3.191 ± 1.602
2.128HisLeu: 2.128 ± 1.556
1.064HisMet: 1.064 ± 0.525
2.128HisAsn: 2.128 ± 1.722
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
4.255HisArg: 4.255 ± 5.148
0.0HisSer: 0.0 ± 0.0
2.128HisThr: 2.128 ± 1.847
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.128HisTyr: 2.128 ± 1.049
0.0HisXaa: 0.0 ± 0.0
Ile
1.064IleAla: 1.064 ± 0.525
1.064IleCys: 1.064 ± 0.525
1.064IleAsp: 1.064 ± 0.525
2.128IleGlu: 2.128 ± 1.847
2.128IlePhe: 2.128 ± 1.049
3.191IleGly: 3.191 ± 1.645
1.064IleHis: 1.064 ± 1.816
3.191IleIle: 3.191 ± 1.645
6.383IleLys: 6.383 ± 1.972
5.319IleLeu: 5.319 ± 2.623
0.0IleMet: 0.0 ± 0.0
3.191IleAsn: 3.191 ± 1.645
2.128IlePro: 2.128 ± 1.556
2.128IleGln: 2.128 ± 1.556
0.0IleArg: 0.0 ± 0.0
5.319IleSer: 5.319 ± 2.623
6.383IleThr: 6.383 ± 2.161
1.064IleVal: 1.064 ± 2.161
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.191LysAla: 3.191 ± 1.574
0.0LysCys: 0.0 ± 0.0
2.128LysAsp: 2.128 ± 1.049
1.064LysGlu: 1.064 ± 0.525
2.128LysPhe: 2.128 ± 1.847
2.128LysGly: 2.128 ± 1.847
2.128LysHis: 2.128 ± 1.556
6.383LysIle: 6.383 ± 1.972
8.511LysLys: 8.511 ± 2.129
10.638LysLeu: 10.638 ± 4.107
2.128LysMet: 2.128 ± 1.049
3.191LysAsn: 3.191 ± 1.602
8.511LysPro: 8.511 ± 4.844
2.128LysGln: 2.128 ± 1.049
4.255LysArg: 4.255 ± 1.524
4.255LysSer: 4.255 ± 1.783
7.447LysThr: 7.447 ± 2.511
2.128LysVal: 2.128 ± 1.049
4.255LysTrp: 4.255 ± 1.648
4.255LysTyr: 4.255 ± 2.099
0.0LysXaa: 0.0 ± 0.0
Leu
2.128LeuAla: 2.128 ± 1.722
1.064LeuCys: 1.064 ± 2.161
3.191LeuAsp: 3.191 ± 1.574
2.128LeuGlu: 2.128 ± 3.31
5.319LeuPhe: 5.319 ± 1.369
2.128LeuGly: 2.128 ± 1.049
2.128LeuHis: 2.128 ± 1.847
1.064LeuIle: 1.064 ± 0.525
6.383LeuLys: 6.383 ± 1.972
10.638LeuLeu: 10.638 ± 5.246
1.064LeuMet: 1.064 ± 0.525
8.511LeuAsn: 8.511 ± 2.929
6.383LeuPro: 6.383 ± 2.161
12.766LeuGln: 12.766 ± 5.477
3.191LeuArg: 3.191 ± 1.574
6.383LeuSer: 6.383 ± 1.972
9.574LeuThr: 9.574 ± 2.356
2.128LeuVal: 2.128 ± 1.722
1.064LeuTrp: 1.064 ± 0.525
5.319LeuTyr: 5.319 ± 1.761
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.128MetAsp: 2.128 ± 1.049
1.064MetGlu: 1.064 ± 0.525
2.128MetPhe: 2.128 ± 2.805
1.064MetGly: 1.064 ± 0.525
1.064MetHis: 1.064 ± 1.979
2.128MetIle: 2.128 ± 1.049
2.128MetLys: 2.128 ± 1.556
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.128MetPro: 2.128 ± 1.722
1.064MetGln: 1.064 ± 0.525
2.128MetArg: 2.128 ± 1.049
2.128MetSer: 2.128 ± 1.847
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.128AsnAla: 2.128 ± 1.847
2.128AsnCys: 2.128 ± 1.049
4.255AsnAsp: 4.255 ± 1.648
1.064AsnGlu: 1.064 ± 1.816
4.255AsnPhe: 4.255 ± 3.444
1.064AsnGly: 1.064 ± 1.816
0.0AsnHis: 0.0 ± 0.0
2.128AsnIle: 2.128 ± 1.556
5.319AsnLys: 5.319 ± 1.761
4.255AsnLeu: 4.255 ± 2.099
1.064AsnMet: 1.064 ± 0.525
4.255AsnAsn: 4.255 ± 2.099
4.255AsnPro: 4.255 ± 1.524
4.255AsnGln: 4.255 ± 3.279
1.064AsnArg: 1.064 ± 1.979
2.128AsnSer: 2.128 ± 1.556
6.383AsnThr: 6.383 ± 2.104
3.191AsnVal: 3.191 ± 1.602
2.128AsnTrp: 2.128 ± 1.847
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.064ProAla: 1.064 ± 1.979
1.064ProCys: 1.064 ± 2.161
3.191ProAsp: 3.191 ± 1.602
5.319ProGlu: 5.319 ± 2.623
4.255ProPhe: 4.255 ± 1.648
3.191ProGly: 3.191 ± 1.574
0.0ProHis: 0.0 ± 0.0
2.128ProIle: 2.128 ± 1.049
4.255ProLys: 4.255 ± 1.648
8.511ProLeu: 8.511 ± 3.609
2.128ProMet: 2.128 ± 1.049
5.319ProAsn: 5.319 ± 1.794
10.638ProPro: 10.638 ± 2.923
5.319ProGln: 5.319 ± 2.623
4.255ProArg: 4.255 ± 3.279
3.191ProSer: 3.191 ± 1.448
4.255ProThr: 4.255 ± 2.099
3.191ProVal: 3.191 ± 1.448
1.064ProTrp: 1.064 ± 1.979
4.255ProTyr: 4.255 ± 2.099
0.0ProXaa: 0.0 ± 0.0
Gln
3.191GlnAla: 3.191 ± 1.574
2.128GlnCys: 2.128 ± 4.322
2.128GlnAsp: 2.128 ± 1.556
1.064GlnGlu: 1.064 ± 2.161
1.064GlnPhe: 1.064 ± 0.525
1.064GlnGly: 1.064 ± 0.525
2.128GlnHis: 2.128 ± 1.556
0.0GlnIle: 0.0 ± 0.0
3.191GlnLys: 3.191 ± 1.448
6.383GlnLeu: 6.383 ± 2.267
1.064GlnMet: 1.064 ± 1.979
5.319GlnAsn: 5.319 ± 2.961
6.383GlnPro: 6.383 ± 3.148
1.064GlnGln: 1.064 ± 0.525
5.319GlnArg: 5.319 ± 1.761
1.064GlnSer: 1.064 ± 0.525
5.319GlnThr: 5.319 ± 1.761
0.0GlnVal: 0.0 ± 0.0
2.128GlnTrp: 2.128 ± 1.556
1.064GlnTyr: 1.064 ± 0.525
0.0GlnXaa: 0.0 ± 0.0
Arg
1.064ArgAla: 1.064 ± 0.525
1.064ArgCys: 1.064 ± 0.525
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
1.064ArgPhe: 1.064 ± 0.525
3.191ArgGly: 3.191 ± 1.574
3.191ArgHis: 3.191 ± 1.645
6.383ArgIle: 6.383 ± 1.664
6.383ArgLys: 6.383 ± 2.897
5.319ArgLeu: 5.319 ± 1.369
0.0ArgMet: 0.0 ± 0.0
2.128ArgAsn: 2.128 ± 1.049
2.128ArgPro: 2.128 ± 1.556
3.191ArgGln: 3.191 ± 5.448
19.149ArgArg: 19.149 ± 5.239
3.191ArgSer: 3.191 ± 1.574
3.191ArgThr: 3.191 ± 1.448
2.128ArgVal: 2.128 ± 1.049
0.0ArgTrp: 0.0 ± 0.0
6.383ArgTyr: 6.383 ± 2.161
0.0ArgXaa: 0.0 ± 0.0
Ser
4.255SerAla: 4.255 ± 2.099
2.128SerCys: 2.128 ± 1.556
5.319SerAsp: 5.319 ± 3.202
7.447SerGlu: 7.447 ± 2.426
1.064SerPhe: 1.064 ± 0.525
1.064SerGly: 1.064 ± 0.525
0.0SerHis: 0.0 ± 0.0
5.319SerIle: 5.319 ± 2.623
4.255SerLys: 4.255 ± 3.641
5.319SerLeu: 5.319 ± 1.369
2.128SerMet: 2.128 ± 0.981
3.191SerAsn: 3.191 ± 3.341
4.255SerPro: 4.255 ± 2.099
0.0SerGln: 0.0 ± 0.0
1.064SerArg: 1.064 ± 1.816
11.702SerSer: 11.702 ± 13.636
3.191SerThr: 3.191 ± 1.574
1.064SerVal: 1.064 ± 0.525
0.0SerTrp: 0.0 ± 0.0
2.128SerTyr: 2.128 ± 1.556
0.0SerXaa: 0.0 ± 0.0
Thr
4.255ThrAla: 4.255 ± 2.099
2.128ThrCys: 2.128 ± 1.556
2.128ThrAsp: 2.128 ± 1.847
7.447ThrGlu: 7.447 ± 3.204
1.064ThrPhe: 1.064 ± 1.816
7.447ThrGly: 7.447 ± 6.169
1.064ThrHis: 1.064 ± 1.816
9.574ThrIle: 9.574 ± 4.935
6.383ThrLys: 6.383 ± 3.148
8.511ThrLeu: 8.511 ± 2.726
1.064ThrMet: 1.064 ± 1.816
4.255ThrAsn: 4.255 ± 1.597
2.128ThrPro: 2.128 ± 1.049
5.319ThrGln: 5.319 ± 2.623
1.064ThrArg: 1.064 ± 0.525
8.511ThrSer: 8.511 ± 6.226
6.383ThrThr: 6.383 ± 1.057
6.383ThrVal: 6.383 ± 3.148
1.064ThrTrp: 1.064 ± 0.525
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.064ValCys: 1.064 ± 0.525
4.255ValAsp: 4.255 ± 1.597
3.191ValGlu: 3.191 ± 1.645
2.128ValPhe: 2.128 ± 1.049
1.064ValGly: 1.064 ± 0.525
0.0ValHis: 0.0 ± 0.0
3.191ValIle: 3.191 ± 1.448
1.064ValLys: 1.064 ± 0.525
2.128ValLeu: 2.128 ± 1.049
1.064ValMet: 1.064 ± 0.525
2.128ValAsn: 2.128 ± 3.957
1.064ValPro: 1.064 ± 0.525
2.128ValGln: 2.128 ± 1.049
1.064ValArg: 1.064 ± 0.525
0.0ValSer: 0.0 ± 0.0
5.319ValThr: 5.319 ± 1.369
1.064ValVal: 1.064 ± 2.161
0.0ValTrp: 0.0 ± 0.0
2.128ValTyr: 2.128 ± 1.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.064TrpPhe: 1.064 ± 0.525
2.128TrpGly: 2.128 ± 1.049
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
4.255TrpLeu: 4.255 ± 1.648
1.064TrpMet: 1.064 ± 1.979
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.064TrpGln: 1.064 ± 0.525
4.255TrpArg: 4.255 ± 2.099
1.064TrpSer: 1.064 ± 0.525
3.191TrpThr: 3.191 ± 2.244
0.0TrpVal: 0.0 ± 0.0
1.064TrpTrp: 1.064 ± 0.525
1.064TrpTyr: 1.064 ± 0.525
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.128TyrAla: 2.128 ± 1.049
1.064TyrCys: 1.064 ± 1.816
0.0TyrAsp: 0.0 ± 0.0
3.191TyrGlu: 3.191 ± 1.574
4.255TyrPhe: 4.255 ± 2.099
0.0TyrGly: 0.0 ± 0.0
1.064TyrHis: 1.064 ± 1.979
1.064TyrIle: 1.064 ± 0.525
3.191TyrLys: 3.191 ± 1.645
2.128TyrLeu: 2.128 ± 1.049
0.0TyrMet: 0.0 ± 0.0
1.064TyrAsn: 1.064 ± 0.525
4.255TyrPro: 4.255 ± 1.648
2.128TyrGln: 2.128 ± 1.049
6.383TyrArg: 6.383 ± 2.104
1.064TyrSer: 1.064 ± 0.525
2.128TyrThr: 2.128 ± 1.049
4.255TyrVal: 4.255 ± 2.099
4.255TyrTrp: 4.255 ± 2.099
5.319TyrTyr: 5.319 ± 2.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski