Amino acid dipepetide frequency for Torque teno virus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.392AlaAla: 7.392 ± 11.946
1.056AlaCys: 1.056 ± 0.499
3.168AlaAsp: 3.168 ± 1.498
4.224AlaGlu: 4.224 ± 1.997
2.112AlaPhe: 2.112 ± 2.862
6.336AlaGly: 6.336 ± 12.445
2.112AlaHis: 2.112 ± 6.722
1.056AlaIle: 1.056 ± 0.499
1.056AlaLys: 1.056 ± 0.499
6.336AlaLeu: 6.336 ± 4.724
1.056AlaMet: 1.056 ± 0.499
0.0AlaAsn: 0.0 ± 0.0
8.448AlaPro: 8.448 ± 7.586
1.056AlaGln: 1.056 ± 0.499
5.28AlaArg: 5.28 ± 1.364
3.168AlaSer: 3.168 ± 2.362
1.056AlaThr: 1.056 ± 0.499
4.224AlaVal: 4.224 ± 5.723
0.0AlaTrp: 0.0 ± 0.0
1.056AlaTyr: 1.056 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.056CysGlu: 1.056 ± 0.499
2.112CysPhe: 2.112 ± 2.862
3.168CysGly: 3.168 ± 6.223
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.112CysPro: 2.112 ± 0.999
0.0CysGln: 0.0 ± 0.0
3.168CysArg: 3.168 ± 1.498
1.056CysSer: 1.056 ± 0.499
0.0CysThr: 0.0 ± 0.0
3.168CysVal: 3.168 ± 1.498
0.0CysTrp: 0.0 ± 0.0
1.056CysTyr: 1.056 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
7.392AspAla: 7.392 ± 11.946
0.0AspCys: 0.0 ± 0.0
3.168AspAsp: 3.168 ± 1.498
3.168AspGlu: 3.168 ± 1.498
3.168AspPhe: 3.168 ± 1.498
1.056AspGly: 1.056 ± 0.499
0.0AspHis: 0.0 ± 0.0
0.0AspIle: 0.0 ± 0.0
4.224AspLys: 4.224 ± 1.997
5.28AspLeu: 5.28 ± 2.497
1.056AspMet: 1.056 ± 0.499
0.0AspAsn: 0.0 ± 0.0
3.168AspPro: 3.168 ± 1.498
1.056AspGln: 1.056 ± 0.499
2.112AspArg: 2.112 ± 0.999
2.112AspSer: 2.112 ± 0.999
6.336AspThr: 6.336 ± 0.864
5.28AspVal: 5.28 ± 2.497
1.056AspTrp: 1.056 ± 0.499
2.112AspTyr: 2.112 ± 0.999
0.0AspXaa: 0.0 ± 0.0
Glu
3.168GluAla: 3.168 ± 1.498
0.0GluCys: 0.0 ± 0.0
5.28GluAsp: 5.28 ± 1.364
6.336GluGlu: 6.336 ± 4.724
0.0GluPhe: 0.0 ± 0.0
1.056GluGly: 1.056 ± 0.499
1.056GluHis: 1.056 ± 0.499
1.056GluIle: 1.056 ± 0.499
3.168GluLys: 3.168 ± 1.498
3.168GluLeu: 3.168 ± 2.362
1.056GluMet: 1.056 ± 0.499
1.056GluAsn: 1.056 ± 0.499
4.224GluPro: 4.224 ± 1.863
3.168GluGln: 3.168 ± 1.498
2.112GluArg: 2.112 ± 2.862
2.112GluSer: 2.112 ± 2.862
1.056GluThr: 1.056 ± 0.499
1.056GluVal: 1.056 ± 0.499
1.056GluTrp: 1.056 ± 0.499
2.112GluTyr: 2.112 ± 2.862
0.0GluXaa: 0.0 ± 0.0
Phe
1.056PheAla: 1.056 ± 3.361
1.056PheCys: 1.056 ± 0.499
2.112PheAsp: 2.112 ± 0.999
3.168PheGlu: 3.168 ± 2.362
0.0PhePhe: 0.0 ± 0.0
4.224PheGly: 4.224 ± 1.997
2.112PheHis: 2.112 ± 0.999
1.056PheIle: 1.056 ± 0.499
2.112PheLys: 2.112 ± 0.999
3.168PheLeu: 3.168 ± 1.498
0.0PheMet: 0.0 ± 0.0
1.056PheAsn: 1.056 ± 3.361
2.112PhePro: 2.112 ± 0.999
5.28PheGln: 5.28 ± 2.497
1.056PheArg: 1.056 ± 3.361
1.056PheSer: 1.056 ± 0.499
2.112PheThr: 2.112 ± 0.999
3.168PheVal: 3.168 ± 2.362
0.0PheTrp: 0.0 ± 0.0
3.168PheTyr: 3.168 ± 1.498
0.0PheXaa: 0.0 ± 0.0
Gly
4.224GlyAla: 4.224 ± 1.863
2.112GlyCys: 2.112 ± 2.862
4.224GlyAsp: 4.224 ± 5.723
2.112GlyGlu: 2.112 ± 2.862
0.0GlyPhe: 0.0 ± 0.0
10.56GlyGly: 10.56 ± 22.029
3.168GlyHis: 3.168 ± 2.362
3.168GlyIle: 3.168 ± 2.362
4.224GlyLys: 4.224 ± 1.997
3.168GlyLeu: 3.168 ± 1.498
2.112GlyMet: 2.112 ± 0.999
2.112GlyAsn: 2.112 ± 0.999
6.336GlyPro: 6.336 ± 8.585
3.168GlyGln: 3.168 ± 1.498
7.392GlyArg: 7.392 ± 4.225
4.224GlySer: 4.224 ± 1.997
1.056GlyThr: 1.056 ± 0.499
2.112GlyVal: 2.112 ± 0.999
0.0GlyTrp: 0.0 ± 0.0
4.224GlyTyr: 4.224 ± 1.997
0.0GlyXaa: 0.0 ± 0.0
His
1.056HisAla: 1.056 ± 3.361
0.0HisCys: 0.0 ± 0.0
1.056HisAsp: 1.056 ± 0.499
0.0HisGlu: 0.0 ± 0.0
1.056HisPhe: 1.056 ± 3.361
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.056HisIle: 1.056 ± 0.499
0.0HisLys: 0.0 ± 0.0
3.168HisLeu: 3.168 ± 2.362
1.056HisMet: 1.056 ± 0.426
5.28HisAsn: 5.28 ± 5.224
3.168HisPro: 3.168 ± 1.498
2.112HisGln: 2.112 ± 0.999
2.112HisArg: 2.112 ± 0.999
2.112HisSer: 2.112 ± 0.999
3.168HisThr: 3.168 ± 1.498
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.056IleAla: 1.056 ± 0.499
1.056IleCys: 1.056 ± 0.499
1.056IleAsp: 1.056 ± 0.499
1.056IleGlu: 1.056 ± 3.361
2.112IlePhe: 2.112 ± 0.999
1.056IleGly: 1.056 ± 0.499
2.112IleHis: 2.112 ± 0.999
2.112IleIle: 2.112 ± 0.999
3.168IleLys: 3.168 ± 1.498
2.112IleLeu: 2.112 ± 0.999
0.0IleMet: 0.0 ± 0.0
2.112IleAsn: 2.112 ± 0.999
4.224IlePro: 4.224 ± 1.997
1.056IleGln: 1.056 ± 0.499
2.112IleArg: 2.112 ± 2.862
2.112IleSer: 2.112 ± 0.999
2.112IleThr: 2.112 ± 0.999
3.168IleVal: 3.168 ± 1.498
1.056IleTrp: 1.056 ± 0.499
3.168IleTyr: 3.168 ± 1.498
0.0IleXaa: 0.0 ± 0.0
Lys
3.168LysAla: 3.168 ± 1.498
1.056LysCys: 1.056 ± 0.499
3.168LysAsp: 3.168 ± 1.498
0.0LysGlu: 0.0 ± 0.0
3.168LysPhe: 3.168 ± 1.498
3.168LysGly: 3.168 ± 1.498
1.056LysHis: 1.056 ± 0.499
5.28LysIle: 5.28 ± 2.497
3.168LysLys: 3.168 ± 1.498
4.224LysLeu: 4.224 ± 1.997
0.0LysMet: 0.0 ± 0.0
1.056LysAsn: 1.056 ± 0.499
6.336LysPro: 6.336 ± 2.996
2.112LysGln: 2.112 ± 0.999
4.224LysArg: 4.224 ± 1.997
3.168LysSer: 3.168 ± 1.498
4.224LysThr: 4.224 ± 1.997
1.056LysVal: 1.056 ± 0.499
1.056LysTrp: 1.056 ± 0.499
1.056LysTyr: 1.056 ± 0.499
0.0LysXaa: 0.0 ± 0.0
Leu
1.056LeuAla: 1.056 ± 0.499
2.112LeuCys: 2.112 ± 0.999
5.28LeuAsp: 5.28 ± 1.364
2.112LeuGlu: 2.112 ± 0.999
5.28LeuPhe: 5.28 ± 1.364
5.28LeuGly: 5.28 ± 2.497
1.056LeuHis: 1.056 ± 0.499
1.056LeuIle: 1.056 ± 0.499
5.28LeuLys: 5.28 ± 2.497
3.168LeuLeu: 3.168 ± 1.498
1.056LeuMet: 1.056 ± 0.499
2.112LeuAsn: 2.112 ± 2.862
4.224LeuPro: 4.224 ± 1.863
8.448LeuGln: 8.448 ± 3.995
4.224LeuArg: 4.224 ± 1.997
4.224LeuSer: 4.224 ± 1.997
6.336LeuThr: 6.336 ± 0.864
4.224LeuVal: 4.224 ± 1.997
1.056LeuTrp: 1.056 ± 0.499
1.056LeuTyr: 1.056 ± 0.499
0.0LeuXaa: 0.0 ± 0.0
Met
1.056MetAla: 1.056 ± 0.499
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.056MetGlu: 1.056 ± 0.499
0.0MetPhe: 0.0 ± 0.0
1.056MetGly: 1.056 ± 0.499
1.056MetHis: 1.056 ± 0.499
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.056MetLeu: 1.056 ± 0.499
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.056MetPro: 1.056 ± 0.499
1.056MetGln: 1.056 ± 0.499
1.056MetArg: 1.056 ± 0.499
2.112MetSer: 2.112 ± 2.862
0.0MetThr: 0.0 ± 0.0
2.112MetVal: 2.112 ± 0.999
0.0MetTrp: 0.0 ± 0.0
1.056MetTyr: 1.056 ± 0.499
0.0MetXaa: 0.0 ± 0.0
Asn
3.168AsnAla: 3.168 ± 2.362
0.0AsnCys: 0.0 ± 0.0
3.168AsnAsp: 3.168 ± 1.498
2.112AsnGlu: 2.112 ± 0.999
2.112AsnPhe: 2.112 ± 0.999
3.168AsnGly: 3.168 ± 2.362
1.056AsnHis: 1.056 ± 3.361
3.168AsnIle: 3.168 ± 1.498
3.168AsnLys: 3.168 ± 1.498
3.168AsnLeu: 3.168 ± 1.498
0.0AsnMet: 0.0 ± 0.0
3.168AsnAsn: 3.168 ± 1.498
5.28AsnPro: 5.28 ± 1.364
1.056AsnGln: 1.056 ± 0.499
1.056AsnArg: 1.056 ± 0.499
2.112AsnSer: 2.112 ± 0.999
1.056AsnThr: 1.056 ± 0.499
1.056AsnVal: 1.056 ± 3.361
1.056AsnTrp: 1.056 ± 3.361
2.112AsnTyr: 2.112 ± 0.999
0.0AsnXaa: 0.0 ± 0.0
Pro
9.504ProAla: 9.504 ± 14.807
2.112ProCys: 2.112 ± 0.999
4.224ProAsp: 4.224 ± 1.997
4.224ProGlu: 4.224 ± 5.723
4.224ProPhe: 4.224 ± 1.997
4.224ProGly: 4.224 ± 1.863
1.056ProHis: 1.056 ± 0.499
2.112ProIle: 2.112 ± 0.999
2.112ProLys: 2.112 ± 0.999
3.168ProLeu: 3.168 ± 1.498
4.224ProMet: 4.224 ± 1.997
2.112ProAsn: 2.112 ± 0.999
10.56ProPro: 10.56 ± 14.308
4.224ProGln: 4.224 ± 1.863
8.448ProArg: 8.448 ± 0.135
4.224ProSer: 4.224 ± 1.997
4.224ProThr: 4.224 ± 1.863
1.056ProVal: 1.056 ± 0.499
2.112ProTrp: 2.112 ± 2.862
2.112ProTyr: 2.112 ± 0.999
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
4.224GlnAsp: 4.224 ± 1.997
0.0GlnGlu: 0.0 ± 0.0
1.056GlnPhe: 1.056 ± 0.499
5.28GlnGly: 5.28 ± 2.497
2.112GlnHis: 2.112 ± 0.999
3.168GlnIle: 3.168 ± 2.362
6.336GlnLys: 6.336 ± 2.996
4.224GlnLeu: 4.224 ± 1.997
0.0GlnMet: 0.0 ± 1.731
0.0GlnAsn: 0.0 ± 0.0
1.056GlnPro: 1.056 ± 0.499
7.392GlnGln: 7.392 ± 3.495
3.168GlnArg: 3.168 ± 1.498
1.056GlnSer: 1.056 ± 0.499
3.168GlnThr: 3.168 ± 1.498
4.224GlnVal: 4.224 ± 1.997
2.112GlnTrp: 2.112 ± 0.999
4.224GlnTyr: 4.224 ± 1.997
0.0GlnXaa: 0.0 ± 0.0
Arg
5.28ArgAla: 5.28 ± 1.364
0.0ArgCys: 0.0 ± 0.0
3.168ArgAsp: 3.168 ± 2.362
4.224ArgGlu: 4.224 ± 1.997
1.056ArgPhe: 1.056 ± 0.499
6.336ArgGly: 6.336 ± 4.724
0.0ArgHis: 0.0 ± 0.0
1.056ArgIle: 1.056 ± 0.499
3.168ArgLys: 3.168 ± 1.498
3.168ArgLeu: 3.168 ± 1.498
1.056ArgMet: 1.056 ± 0.499
6.336ArgAsn: 6.336 ± 0.864
5.28ArgPro: 5.28 ± 2.497
3.168ArgGln: 3.168 ± 1.498
32.735ArgArg: 32.735 ± 11.62
6.336ArgSer: 6.336 ± 4.724
3.168ArgThr: 3.168 ± 1.498
4.224ArgVal: 4.224 ± 1.997
7.392ArgTrp: 7.392 ± 3.495
6.336ArgTyr: 6.336 ± 0.864
0.0ArgXaa: 0.0 ± 0.0
Ser
3.168SerAla: 3.168 ± 1.498
2.112SerCys: 2.112 ± 2.862
2.112SerAsp: 2.112 ± 0.999
4.224SerGlu: 4.224 ± 1.997
3.168SerPhe: 3.168 ± 1.498
5.28SerGly: 5.28 ± 5.224
3.168SerHis: 3.168 ± 2.362
2.112SerIle: 2.112 ± 0.999
1.056SerLys: 1.056 ± 0.499
5.28SerLeu: 5.28 ± 1.364
0.0SerMet: 0.0 ± 0.0
3.168SerAsn: 3.168 ± 1.498
3.168SerPro: 3.168 ± 1.498
2.112SerGln: 2.112 ± 0.999
2.112SerArg: 2.112 ± 0.999
1.056SerSer: 1.056 ± 0.499
7.392SerThr: 7.392 ± 3.495
2.112SerVal: 2.112 ± 0.999
0.0SerTrp: 0.0 ± 0.0
3.168SerTyr: 3.168 ± 1.498
0.0SerXaa: 0.0 ± 0.0
Thr
4.224ThrAla: 4.224 ± 1.997
1.056ThrCys: 1.056 ± 0.499
2.112ThrAsp: 2.112 ± 0.999
0.0ThrGlu: 0.0 ± 0.0
3.168ThrPhe: 3.168 ± 1.498
1.056ThrGly: 1.056 ± 0.499
2.112ThrHis: 2.112 ± 0.999
3.168ThrIle: 3.168 ± 1.498
0.0ThrLys: 0.0 ± 0.0
6.336ThrLeu: 6.336 ± 2.996
0.0ThrMet: 0.0 ± 0.0
7.392ThrAsn: 7.392 ± 3.495
2.112ThrPro: 2.112 ± 6.722
3.168ThrGln: 3.168 ± 2.362
7.392ThrArg: 7.392 ± 3.495
1.056ThrSer: 1.056 ± 0.499
5.28ThrThr: 5.28 ± 2.497
4.224ThrVal: 4.224 ± 1.997
3.168ThrTrp: 3.168 ± 1.498
2.112ThrTyr: 2.112 ± 0.999
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
3.168ValCys: 3.168 ± 2.362
0.0ValAsp: 0.0 ± 0.0
3.168ValGlu: 3.168 ± 2.362
1.056ValPhe: 1.056 ± 0.499
2.112ValGly: 2.112 ± 0.999
1.056ValHis: 1.056 ± 0.499
4.224ValIle: 4.224 ± 1.997
3.168ValLys: 3.168 ± 1.498
5.28ValLeu: 5.28 ± 2.497
0.0ValMet: 0.0 ± 0.0
2.112ValAsn: 2.112 ± 2.862
2.112ValPro: 2.112 ± 2.862
2.112ValGln: 2.112 ± 0.999
5.28ValArg: 5.28 ± 2.497
5.28ValSer: 5.28 ± 2.497
4.224ValThr: 4.224 ± 1.997
4.224ValVal: 4.224 ± 1.997
1.056ValTrp: 1.056 ± 0.499
3.168ValTyr: 3.168 ± 1.498
0.0ValXaa: 0.0 ± 0.0
Trp
1.056TrpAla: 1.056 ± 0.499
0.0TrpCys: 0.0 ± 0.0
1.056TrpAsp: 1.056 ± 0.499
0.0TrpGlu: 0.0 ± 0.0
2.112TrpPhe: 2.112 ± 2.862
2.112TrpGly: 2.112 ± 0.999
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.056TrpLys: 1.056 ± 0.499
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.056TrpAsn: 1.056 ± 0.499
0.0TrpPro: 0.0 ± 0.0
1.056TrpGln: 1.056 ± 0.499
6.336TrpArg: 6.336 ± 0.864
3.168TrpSer: 3.168 ± 1.498
0.0TrpThr: 0.0 ± 0.0
1.056TrpVal: 1.056 ± 0.499
2.112TrpTrp: 2.112 ± 0.999
3.168TrpTyr: 3.168 ± 1.498
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.168TyrAla: 3.168 ± 6.223
0.0TyrCys: 0.0 ± 0.0
3.168TyrAsp: 3.168 ± 1.498
1.056TyrGlu: 1.056 ± 0.499
2.112TyrPhe: 2.112 ± 0.999
3.168TyrGly: 3.168 ± 1.498
2.112TyrHis: 2.112 ± 0.999
3.168TyrIle: 3.168 ± 1.498
5.28TyrLys: 5.28 ± 2.497
3.168TyrLeu: 3.168 ± 1.498
0.0TyrMet: 0.0 ± 0.0
2.112TyrAsn: 2.112 ± 0.999
5.28TyrPro: 5.28 ± 2.497
1.056TyrGln: 1.056 ± 0.499
2.112TyrArg: 2.112 ± 0.999
4.224TyrSer: 4.224 ± 1.997
3.168TyrThr: 3.168 ± 1.498
1.056TyrVal: 1.056 ± 0.499
1.056TyrTrp: 1.056 ± 0.499
2.112TyrTyr: 2.112 ± 0.999
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (948 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski