Amino acid dipepetide frequency for Torque teno equus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.268AlaAla: 3.268 ± 1.748
1.089AlaCys: 1.089 ± 2.024
3.268AlaAsp: 3.268 ± 1.748
1.089AlaGlu: 1.089 ± 0.583
3.268AlaPhe: 3.268 ± 4.86
2.179AlaGly: 2.179 ± 1.764
2.179AlaHis: 2.179 ± 1.764
2.179AlaIle: 2.179 ± 1.419
4.357AlaLys: 4.357 ± 2.839
3.268AlaLeu: 3.268 ± 2.286
1.089AlaMet: 1.089 ± 2.596
1.089AlaAsn: 1.089 ± 0.583
3.268AlaPro: 3.268 ± 1.676
2.179AlaGln: 2.179 ± 2.29
3.268AlaArg: 3.268 ± 1.748
2.179AlaSer: 2.179 ± 1.764
5.447AlaThr: 5.447 ± 1.876
2.179AlaVal: 2.179 ± 2.29
3.268AlaTrp: 3.268 ± 1.748
1.089AlaTyr: 1.089 ± 0.583
0.0AlaXaa: 0.0 ± 0.0
Cys
1.089CysAla: 1.089 ± 0.583
1.089CysCys: 1.089 ± 2.596
1.089CysAsp: 1.089 ± 0.583
0.0CysGlu: 0.0 ± 0.0
1.089CysPhe: 1.089 ± 2.024
1.089CysGly: 1.089 ± 0.583
1.089CysHis: 1.089 ± 0.583
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.268CysLeu: 3.268 ± 3.195
0.0CysMet: 0.0 ± 0.0
1.089CysAsn: 1.089 ± 0.583
2.179CysPro: 2.179 ± 4.048
0.0CysGln: 0.0 ± 0.0
1.089CysArg: 1.089 ± 0.583
1.089CysSer: 1.089 ± 1.775
1.089CysThr: 1.089 ± 2.596
1.089CysVal: 1.089 ± 2.596
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
3.268AspAsp: 3.268 ± 1.748
2.179AspGlu: 2.179 ± 1.419
0.0AspPhe: 0.0 ± 0.0
2.179AspGly: 2.179 ± 2.29
1.089AspHis: 1.089 ± 0.583
5.447AspIle: 5.447 ± 2.571
2.179AspLys: 2.179 ± 1.165
5.447AspLeu: 5.447 ± 1.531
1.089AspMet: 1.089 ± 0.583
1.089AspAsn: 1.089 ± 0.583
4.357AspPro: 4.357 ± 1.784
2.179AspGln: 2.179 ± 1.419
2.179AspArg: 2.179 ± 1.165
4.357AspSer: 4.357 ± 2.839
4.357AspThr: 4.357 ± 1.334
2.179AspVal: 2.179 ± 1.419
6.536AspTrp: 6.536 ± 2.064
3.268AspTyr: 3.268 ± 1.748
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
5.447GluAsp: 5.447 ± 4.567
4.357GluGlu: 4.357 ± 1.334
1.089GluPhe: 1.089 ± 1.775
2.179GluGly: 2.179 ± 1.419
0.0GluHis: 0.0 ± 0.0
2.179GluIle: 2.179 ± 1.419
4.357GluLys: 4.357 ± 1.334
2.179GluLeu: 2.179 ± 1.165
2.179GluMet: 2.179 ± 1.222
1.089GluAsn: 1.089 ± 1.775
2.179GluPro: 2.179 ± 1.165
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
6.536GluSer: 6.536 ± 2.064
6.536GluThr: 6.536 ± 2.064
0.0GluVal: 0.0 ± 0.0
2.179GluTrp: 2.179 ± 1.165
3.268GluTyr: 3.268 ± 1.748
0.0GluXaa: 0.0 ± 0.0
Phe
3.268PheAla: 3.268 ± 1.676
2.179PheCys: 2.179 ± 2.29
2.179PheAsp: 2.179 ± 1.165
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
2.179PheGly: 2.179 ± 2.29
0.0PheHis: 0.0 ± 0.0
2.179PheIle: 2.179 ± 1.165
1.089PheLys: 1.089 ± 2.024
2.179PheLeu: 2.179 ± 4.048
1.089PheMet: 1.089 ± 0.551
1.089PheAsn: 1.089 ± 0.583
0.0PhePro: 0.0 ± 0.0
2.179PheGln: 2.179 ± 1.165
1.089PheArg: 1.089 ± 0.583
3.268PheSer: 3.268 ± 1.248
2.179PheThr: 2.179 ± 1.419
0.0PheVal: 0.0 ± 0.0
1.089PheTrp: 1.089 ± 0.583
1.089PheTyr: 1.089 ± 2.024
0.0PheXaa: 0.0 ± 0.0
Gly
1.089GlyAla: 1.089 ± 0.583
2.179GlyCys: 2.179 ± 3.609
3.268GlyAsp: 3.268 ± 1.676
1.089GlyGlu: 1.089 ± 1.775
4.357GlyPhe: 4.357 ± 1.784
7.625GlyGly: 7.625 ± 2.82
3.268GlyHis: 3.268 ± 1.676
5.447GlyIle: 5.447 ± 2.913
2.179GlyLys: 2.179 ± 1.165
4.357GlyLeu: 4.357 ± 2.071
0.0GlyMet: 0.0 ± 0.0
2.179GlyAsn: 2.179 ± 1.165
4.357GlyPro: 4.357 ± 2.071
3.268GlyGln: 3.268 ± 1.248
6.536GlyArg: 6.536 ± 3.496
8.715GlySer: 8.715 ± 6.069
3.268GlyThr: 3.268 ± 1.676
4.357GlyVal: 4.357 ± 2.33
2.179GlyTrp: 2.179 ± 1.419
3.268GlyTyr: 3.268 ± 1.748
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.089HisCys: 1.089 ± 0.583
2.179HisAsp: 2.179 ± 1.165
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.089HisGly: 1.089 ± 0.583
1.089HisHis: 1.089 ± 0.583
1.089HisIle: 1.089 ± 0.583
3.268HisLys: 3.268 ± 1.248
1.089HisLeu: 1.089 ± 2.596
2.179HisMet: 2.179 ± 2.29
0.0HisAsn: 0.0 ± 0.0
3.268HisPro: 3.268 ± 1.676
1.089HisGln: 1.089 ± 0.583
3.268HisArg: 3.268 ± 1.676
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.089HisTrp: 1.089 ± 0.583
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.179IleAla: 2.179 ± 1.165
1.089IleCys: 1.089 ± 0.583
4.357IleAsp: 4.357 ± 1.334
3.268IleGlu: 3.268 ± 1.248
3.268IlePhe: 3.268 ± 1.748
2.179IleGly: 2.179 ± 1.165
1.089IleHis: 1.089 ± 0.583
2.179IleIle: 2.179 ± 1.165
3.268IleLys: 3.268 ± 1.676
1.089IleLeu: 1.089 ± 0.583
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
0.0IlePro: 0.0 ± 0.0
2.179IleGln: 2.179 ± 1.419
2.179IleArg: 2.179 ± 1.165
1.089IleSer: 1.089 ± 0.583
3.268IleThr: 3.268 ± 1.748
2.179IleVal: 2.179 ± 2.29
1.089IleTrp: 1.089 ± 0.583
1.089IleTyr: 1.089 ± 0.583
0.0IleXaa: 0.0 ± 0.0
Lys
6.536LysAla: 6.536 ± 2.064
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
6.536LysGlu: 6.536 ± 2.064
1.089LysPhe: 1.089 ± 0.583
1.089LysGly: 1.089 ± 0.583
2.179LysHis: 2.179 ± 1.165
1.089LysIle: 1.089 ± 0.583
5.447LysLys: 5.447 ± 1.637
6.536LysLeu: 6.536 ± 2.496
0.0LysMet: 0.0 ± 0.0
4.357LysAsn: 4.357 ± 3.694
5.447LysPro: 5.447 ± 2.913
3.268LysGln: 3.268 ± 1.748
11.983LysArg: 11.983 ± 3.187
4.357LysSer: 4.357 ± 1.784
2.179LysThr: 2.179 ± 1.165
1.089LysVal: 1.089 ± 1.775
3.268LysTrp: 3.268 ± 1.248
1.089LysTyr: 1.089 ± 0.583
0.0LysXaa: 0.0 ± 0.0
Leu
5.447LeuAla: 5.447 ± 5.823
1.089LeuCys: 1.089 ± 2.024
2.179LeuAsp: 2.179 ± 1.764
2.179LeuGlu: 2.179 ± 1.165
2.179LeuPhe: 2.179 ± 1.764
7.625LeuGly: 7.625 ± 4.151
1.089LeuHis: 1.089 ± 0.583
1.089LeuIle: 1.089 ± 1.775
2.179LeuLys: 2.179 ± 1.419
3.268LeuLeu: 3.268 ± 2.286
2.179LeuMet: 2.179 ± 1.419
3.268LeuAsn: 3.268 ± 1.748
1.089LeuPro: 1.089 ± 0.583
5.447LeuGln: 5.447 ± 6.697
4.357LeuArg: 4.357 ± 5.766
2.179LeuSer: 2.179 ± 3.609
6.536LeuThr: 6.536 ± 2.414
7.625LeuVal: 7.625 ± 2.577
2.179LeuTrp: 2.179 ± 1.165
3.268LeuTyr: 3.268 ± 1.748
0.0LeuXaa: 0.0 ± 0.0
Met
2.179MetAla: 2.179 ± 1.165
0.0MetCys: 0.0 ± 0.0
1.089MetAsp: 1.089 ± 2.596
1.089MetGlu: 1.089 ± 0.583
1.089MetPhe: 1.089 ± 0.583
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.089MetLys: 1.089 ± 0.583
1.089MetLeu: 1.089 ± 0.583
0.0MetMet: 0.0 ± 0.0
1.089MetAsn: 1.089 ± 0.583
1.089MetPro: 1.089 ± 2.596
1.089MetGln: 1.089 ± 0.583
1.089MetArg: 1.089 ± 0.583
3.268MetSer: 3.268 ± 3.195
2.179MetThr: 2.179 ± 3.224
2.179MetVal: 2.179 ± 2.29
0.0MetTrp: 0.0 ± 0.0
2.179MetTyr: 2.179 ± 1.419
0.0MetXaa: 0.0 ± 0.0
Asn
3.268AsnAla: 3.268 ± 1.748
0.0AsnCys: 0.0 ± 0.0
1.089AsnAsp: 1.089 ± 0.583
1.089AsnGlu: 1.089 ± 0.583
1.089AsnPhe: 1.089 ± 1.775
3.268AsnGly: 3.268 ± 2.724
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
6.536AsnLys: 6.536 ± 2.064
1.089AsnLeu: 1.089 ± 2.024
0.0AsnMet: 0.0 ± 0.0
2.179AsnAsn: 2.179 ± 1.165
3.268AsnPro: 3.268 ± 1.748
2.179AsnGln: 2.179 ± 1.419
4.357AsnArg: 4.357 ± 2.839
3.268AsnSer: 3.268 ± 1.248
1.089AsnThr: 1.089 ± 0.583
1.089AsnVal: 1.089 ± 1.775
0.0AsnTrp: 0.0 ± 0.0
2.179AsnTyr: 2.179 ± 1.165
0.0AsnXaa: 0.0 ± 0.0
Pro
3.268ProAla: 3.268 ± 1.676
2.179ProCys: 2.179 ± 1.419
1.089ProAsp: 1.089 ± 0.583
2.179ProGlu: 2.179 ± 1.165
1.089ProPhe: 1.089 ± 0.583
4.357ProGly: 4.357 ± 2.33
1.089ProHis: 1.089 ± 2.596
0.0ProIle: 0.0 ± 0.0
6.536ProLys: 6.536 ± 2.442
5.447ProLeu: 5.447 ± 3.035
0.0ProMet: 0.0 ± 0.0
2.179ProAsn: 2.179 ± 1.419
6.536ProPro: 6.536 ± 1.387
0.0ProGln: 0.0 ± 0.0
3.268ProArg: 3.268 ± 1.676
6.536ProSer: 6.536 ± 3.351
3.268ProThr: 3.268 ± 1.676
5.447ProVal: 5.447 ± 1.876
0.0ProTrp: 0.0 ± 0.0
1.089ProTyr: 1.089 ± 0.583
0.0ProXaa: 0.0 ± 0.0
Gln
1.089GlnAla: 1.089 ± 1.775
0.0GlnCys: 0.0 ± 0.0
1.089GlnAsp: 1.089 ± 1.775
2.179GlnGlu: 2.179 ± 1.165
0.0GlnPhe: 0.0 ± 0.0
3.268GlnGly: 3.268 ± 1.748
1.089GlnHis: 1.089 ± 0.583
1.089GlnIle: 1.089 ± 0.583
2.179GlnLys: 2.179 ± 1.419
1.089GlnLeu: 1.089 ± 0.583
1.089GlnMet: 1.089 ± 1.987
3.268GlnAsn: 3.268 ± 2.104
4.357GlnPro: 4.357 ± 2.839
2.179GlnGln: 2.179 ± 1.165
3.268GlnArg: 3.268 ± 3.161
4.357GlnSer: 4.357 ± 2.265
4.357GlnThr: 4.357 ± 2.839
5.447GlnVal: 5.447 ± 2.609
2.179GlnTrp: 2.179 ± 1.165
1.089GlnTyr: 1.089 ± 0.583
0.0GlnXaa: 0.0 ± 0.0
Arg
2.179ArgAla: 2.179 ± 2.773
1.089ArgCys: 1.089 ± 0.583
1.089ArgAsp: 1.089 ± 0.583
1.089ArgGlu: 1.089 ± 1.775
3.268ArgPhe: 3.268 ± 1.676
8.715ArgGly: 8.715 ± 3.386
2.179ArgHis: 2.179 ± 1.419
3.268ArgIle: 3.268 ± 1.748
9.804ArgLys: 9.804 ± 2.929
4.357ArgLeu: 4.357 ± 1.857
1.089ArgMet: 1.089 ± 0.583
2.179ArgAsn: 2.179 ± 1.419
4.357ArgPro: 4.357 ± 1.857
4.357ArgGln: 4.357 ± 1.334
26.144ArgArg: 26.144 ± 12.378
5.447ArgSer: 5.447 ± 3.391
0.0ArgThr: 0.0 ± 0.0
3.268ArgVal: 3.268 ± 1.748
2.179ArgTrp: 2.179 ± 1.165
3.268ArgTyr: 3.268 ± 1.748
0.0ArgXaa: 0.0 ± 0.0
Ser
7.625SerAla: 7.625 ± 10.233
1.089SerCys: 1.089 ± 2.024
6.536SerAsp: 6.536 ± 1.387
9.804SerGlu: 9.804 ± 3.744
1.089SerPhe: 1.089 ± 2.024
7.625SerGly: 7.625 ± 5.14
2.179SerHis: 2.179 ± 2.29
2.179SerIle: 2.179 ± 1.165
3.268SerLys: 3.268 ± 1.748
5.447SerLeu: 5.447 ± 5.049
1.089SerMet: 1.089 ± 2.596
1.089SerAsn: 1.089 ± 0.583
4.357SerPro: 4.357 ± 2.265
3.268SerGln: 3.268 ± 2.724
1.089SerArg: 1.089 ± 0.583
6.536SerSer: 6.536 ± 10.223
9.804SerThr: 9.804 ± 3.744
4.357SerVal: 4.357 ± 2.841
2.179SerTrp: 2.179 ± 1.419
3.268SerTyr: 3.268 ± 2.104
0.0SerXaa: 0.0 ± 0.0
Thr
3.268ThrAla: 3.268 ± 2.286
1.089ThrCys: 1.089 ± 2.596
3.268ThrAsp: 3.268 ± 1.248
5.447ThrGlu: 5.447 ± 4.567
1.089ThrPhe: 1.089 ± 0.583
5.447ThrGly: 5.447 ± 1.637
2.179ThrHis: 2.179 ± 1.165
3.268ThrIle: 3.268 ± 1.748
3.268ThrLys: 3.268 ± 1.748
8.715ThrLeu: 8.715 ± 2.918
3.268ThrMet: 3.268 ± 1.748
3.268ThrAsn: 3.268 ± 1.248
1.089ThrPro: 1.089 ± 2.024
6.536ThrGln: 6.536 ± 3.008
2.179ThrArg: 2.179 ± 1.764
11.983ThrSer: 11.983 ± 3.069
7.625ThrThr: 7.625 ± 2.517
2.179ThrVal: 2.179 ± 1.165
3.268ThrTrp: 3.268 ± 1.748
2.179ThrTyr: 2.179 ± 1.165
0.0ThrXaa: 0.0 ± 0.0
Val
2.179ValAla: 2.179 ± 1.165
1.089ValCys: 1.089 ± 0.583
1.089ValAsp: 1.089 ± 0.583
1.089ValGlu: 1.089 ± 0.583
1.089ValPhe: 1.089 ± 0.583
3.268ValGly: 3.268 ± 2.104
0.0ValHis: 0.0 ± 0.0
3.268ValIle: 3.268 ± 1.748
3.268ValLys: 3.268 ± 1.248
2.179ValLeu: 2.179 ± 3.224
2.179ValMet: 2.179 ± 2.29
2.179ValAsn: 2.179 ± 3.55
1.089ValPro: 1.089 ± 0.583
2.179ValGln: 2.179 ± 1.165
5.447ValArg: 5.447 ± 1.637
4.357ValSer: 4.357 ± 7.448
7.625ValThr: 7.625 ± 4.425
2.179ValVal: 2.179 ± 2.29
0.0ValTrp: 0.0 ± 0.0
3.268ValTyr: 3.268 ± 2.104
0.0ValXaa: 0.0 ± 0.0
Trp
1.089TrpAla: 1.089 ± 0.583
0.0TrpCys: 0.0 ± 0.0
3.268TrpAsp: 3.268 ± 1.248
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
5.447TrpGly: 5.447 ± 2.913
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.089TrpLys: 1.089 ± 1.775
2.179TrpLeu: 2.179 ± 1.165
1.089TrpMet: 1.089 ± 0.583
3.268TrpAsn: 3.268 ± 1.748
2.179TrpPro: 2.179 ± 1.165
1.089TrpGln: 1.089 ± 0.583
2.179TrpArg: 2.179 ± 1.165
3.268TrpSer: 3.268 ± 1.248
4.357TrpThr: 4.357 ± 1.334
1.089TrpVal: 1.089 ± 0.583
2.179TrpTrp: 2.179 ± 1.165
3.268TrpTyr: 3.268 ± 1.748
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.179TyrAla: 2.179 ± 1.165
1.089TyrCys: 1.089 ± 0.583
6.536TyrAsp: 6.536 ± 2.46
1.089TyrGlu: 1.089 ± 0.583
2.179TyrPhe: 2.179 ± 1.165
2.179TyrGly: 2.179 ± 1.165
0.0TyrHis: 0.0 ± 0.0
1.089TyrIle: 1.089 ± 0.583
2.179TyrLys: 2.179 ± 1.165
2.179TyrLeu: 2.179 ± 1.165
1.089TyrMet: 1.089 ± 1.442
1.089TyrAsn: 1.089 ± 0.583
1.089TyrPro: 1.089 ± 0.583
0.0TyrGln: 0.0 ± 0.0
4.357TyrArg: 4.357 ± 1.334
1.089TyrSer: 1.089 ± 2.596
5.447TyrThr: 5.447 ± 2.913
1.089TyrVal: 1.089 ± 0.583
2.179TyrTrp: 2.179 ± 1.165
1.089TyrTyr: 1.089 ± 0.583
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (919 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski