Amino acid dipepetide frequency for Torque teno mini virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.37AlaAla: 2.37 ± 2.374
0.0AlaCys: 0.0 ± 0.0
2.37AlaAsp: 2.37 ± 2.374
1.185AlaGlu: 1.185 ± 2.77
1.185AlaPhe: 1.185 ± 0.586
0.0AlaGly: 0.0 ± 0.0
2.37AlaHis: 2.37 ± 1.171
0.0AlaIle: 0.0 ± 0.0
2.37AlaLys: 2.37 ± 2.374
2.37AlaLeu: 2.37 ± 1.574
1.185AlaMet: 1.185 ± 0.586
4.739AlaAsn: 4.739 ± 1.337
1.185AlaPro: 1.185 ± 0.586
1.185AlaGln: 1.185 ± 0.586
1.185AlaArg: 1.185 ± 1.962
1.185AlaSer: 1.185 ± 0.586
5.924AlaThr: 5.924 ± 1.904
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.185AlaTyr: 1.185 ± 0.586
0.0AlaXaa: 0.0 ± 0.0
Cys
1.185CysAla: 1.185 ± 0.586
0.0CysCys: 0.0 ± 0.0
2.37CysAsp: 2.37 ± 2.374
1.185CysGlu: 1.185 ± 0.586
0.0CysPhe: 0.0 ± 0.0
2.37CysGly: 2.37 ± 2.374
0.0CysHis: 0.0 ± 0.0
1.185CysIle: 1.185 ± 2.77
3.555CysLys: 3.555 ± 1.338
1.185CysLeu: 1.185 ± 2.77
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
3.555CysPro: 3.555 ± 1.338
0.0CysGln: 0.0 ± 0.0
1.185CysArg: 1.185 ± 0.586
1.185CysSer: 1.185 ± 1.962
1.185CysThr: 1.185 ± 1.962
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.37AspAla: 2.37 ± 3.547
1.185AspCys: 1.185 ± 0.586
3.555AspAsp: 3.555 ± 8.311
3.555AspGlu: 3.555 ± 1.338
4.739AspPhe: 4.739 ± 2.342
2.37AspGly: 2.37 ± 2.374
1.185AspHis: 1.185 ± 0.586
3.555AspIle: 3.555 ± 2.971
1.185AspLys: 1.185 ± 0.586
4.739AspLeu: 4.739 ± 1.9
0.0AspMet: 0.0 ± 0.0
1.185AspAsn: 1.185 ± 0.586
8.294AspPro: 8.294 ± 3.929
5.924AspGln: 5.924 ± 4.414
1.185AspArg: 1.185 ± 0.586
5.924AspSer: 5.924 ± 1.573
4.739AspThr: 4.739 ± 2.342
0.0AspVal: 0.0 ± 0.0
0.0AspTrp: 0.0 ± 0.0
2.37AspTyr: 2.37 ± 2.374
0.0AspXaa: 0.0 ± 0.0
Glu
1.185GluAla: 1.185 ± 0.586
0.0GluCys: 0.0 ± 0.0
4.739GluAsp: 4.739 ± 4.441
4.739GluGlu: 4.739 ± 1.9
0.0GluPhe: 0.0 ± 0.0
2.37GluGly: 2.37 ± 1.574
1.185GluHis: 1.185 ± 0.586
1.185GluIle: 1.185 ± 0.586
2.37GluLys: 2.37 ± 2.374
4.739GluLeu: 4.739 ± 5.525
0.0GluMet: 0.0 ± 0.0
1.185GluAsn: 1.185 ± 0.586
2.37GluPro: 2.37 ± 1.171
2.37GluGln: 2.37 ± 1.171
1.185GluArg: 1.185 ± 0.586
3.555GluSer: 3.555 ± 3.509
3.555GluThr: 3.555 ± 1.757
0.0GluVal: 0.0 ± 0.0
1.185GluTrp: 1.185 ± 0.586
1.185GluTyr: 1.185 ± 0.586
0.0GluXaa: 0.0 ± 0.0
Phe
2.37PheAla: 2.37 ± 2.374
1.185PheCys: 1.185 ± 0.586
3.555PheAsp: 3.555 ± 2.069
0.0PheGlu: 0.0 ± 0.0
1.185PhePhe: 1.185 ± 0.586
2.37PheGly: 2.37 ± 2.374
2.37PheHis: 2.37 ± 1.171
4.739PheIle: 4.739 ± 1.337
4.739PheLys: 4.739 ± 2.342
1.185PheLeu: 1.185 ± 2.77
0.0PheMet: 0.0 ± 0.0
2.37PheAsn: 2.37 ± 1.171
5.924PhePro: 5.924 ± 2.928
1.185PheGln: 1.185 ± 0.586
1.185PheArg: 1.185 ± 0.586
1.185PheSer: 1.185 ± 0.586
1.185PheThr: 1.185 ± 0.586
0.0PheVal: 0.0 ± 0.0
3.555PheTrp: 3.555 ± 1.757
3.555PheTyr: 3.555 ± 1.757
0.0PheXaa: 0.0 ± 0.0
Gly
2.37GlyAla: 2.37 ± 2.374
4.739GlyCys: 4.739 ± 4.747
3.555GlyAsp: 3.555 ± 6.055
2.37GlyGlu: 2.37 ± 1.171
2.37GlyPhe: 2.37 ± 2.374
4.739GlyGly: 4.739 ± 2.342
1.185GlyHis: 1.185 ± 0.586
1.185GlyIle: 1.185 ± 2.77
0.0GlyLys: 0.0 ± 0.0
1.185GlyLeu: 1.185 ± 0.586
0.0GlyMet: 0.0 ± 1.963
5.924GlyAsn: 5.924 ± 2.928
3.555GlyPro: 3.555 ± 2.069
0.0GlyGln: 0.0 ± 0.0
1.185GlyArg: 1.185 ± 0.586
1.185GlySer: 1.185 ± 0.586
4.739GlyThr: 4.739 ± 1.9
0.0GlyVal: 0.0 ± 0.0
2.37GlyTrp: 2.37 ± 1.171
2.37GlyTyr: 2.37 ± 1.171
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
4.739HisAsp: 4.739 ± 1.9
1.185HisGlu: 1.185 ± 1.962
2.37HisPhe: 2.37 ± 2.374
1.185HisGly: 1.185 ± 0.586
1.185HisHis: 1.185 ± 1.962
2.37HisIle: 2.37 ± 1.171
1.185HisLys: 1.185 ± 1.962
3.555HisLeu: 3.555 ± 3.509
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.555HisPro: 3.555 ± 1.757
0.0HisGln: 0.0 ± 0.0
1.185HisArg: 1.185 ± 1.962
3.555HisSer: 3.555 ± 1.757
2.37HisThr: 2.37 ± 1.171
0.0HisVal: 0.0 ± 0.0
1.185HisTrp: 1.185 ± 0.586
1.185HisTyr: 1.185 ± 0.586
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
2.37IleCys: 2.37 ± 2.374
3.555IleAsp: 3.555 ± 2.069
1.185IleGlu: 1.185 ± 0.586
5.924IlePhe: 5.924 ± 2.928
4.739IleGly: 4.739 ± 1.9
2.37IleHis: 2.37 ± 1.574
2.37IleIle: 2.37 ± 1.171
4.739IleLys: 4.739 ± 2.342
1.185IleLeu: 1.185 ± 0.586
0.0IleMet: 0.0 ± 0.0
1.185IleAsn: 1.185 ± 0.586
2.37IlePro: 2.37 ± 1.171
4.739IleGln: 4.739 ± 1.337
2.37IleArg: 2.37 ± 1.171
5.924IleSer: 5.924 ± 2.863
5.924IleThr: 5.924 ± 1.904
4.739IleVal: 4.739 ± 2.342
0.0IleTrp: 0.0 ± 0.0
3.555IleTyr: 3.555 ± 1.757
0.0IleXaa: 0.0 ± 0.0
Lys
2.37LysAla: 2.37 ± 1.171
1.185LysCys: 1.185 ± 2.77
4.739LysAsp: 4.739 ± 2.342
1.185LysGlu: 1.185 ± 1.962
4.739LysPhe: 4.739 ± 2.342
1.185LysGly: 1.185 ± 0.586
3.555LysHis: 3.555 ± 2.971
4.739LysIle: 4.739 ± 2.342
8.294LysLys: 8.294 ± 2.429
5.924LysLeu: 5.924 ± 2.928
1.185LysMet: 1.185 ± 0.553
1.185LysAsn: 1.185 ± 0.586
5.924LysPro: 5.924 ± 1.836
3.555LysGln: 3.555 ± 3.509
9.479LysArg: 9.479 ± 4.568
7.109LysSer: 7.109 ± 1.293
9.479LysThr: 9.479 ± 6.297
1.185LysVal: 1.185 ± 2.77
1.185LysTrp: 1.185 ± 0.586
2.37LysTyr: 2.37 ± 2.374
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
1.185LeuCys: 1.185 ± 0.586
2.37LeuAsp: 2.37 ± 1.171
3.555LeuGlu: 3.555 ± 5.126
3.555LeuPhe: 3.555 ± 2.971
2.37LeuGly: 2.37 ± 1.171
2.37LeuHis: 2.37 ± 1.171
4.739LeuIle: 4.739 ± 1.337
8.294LeuLys: 8.294 ± 5.099
8.294LeuLeu: 8.294 ± 0.811
3.555LeuMet: 3.555 ± 1.757
8.294LeuAsn: 8.294 ± 2.391
4.739LeuPro: 4.739 ± 2.342
7.109LeuGln: 7.109 ± 3.292
4.739LeuArg: 4.739 ± 1.337
0.0LeuSer: 0.0 ± 0.0
5.924LeuThr: 5.924 ± 1.836
3.555LeuVal: 3.555 ± 2.069
3.555LeuTrp: 3.555 ± 2.069
2.37LeuTyr: 2.37 ± 1.574
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.37MetLeu: 2.37 ± 1.171
0.0MetMet: 0.0 ± 0.0
2.37MetAsn: 2.37 ± 1.171
1.185MetPro: 1.185 ± 0.586
3.555MetGln: 3.555 ± 1.338
0.0MetArg: 0.0 ± 0.0
1.185MetSer: 1.185 ± 2.77
1.185MetThr: 1.185 ± 0.586
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.185MetTyr: 1.185 ± 0.586
0.0MetXaa: 0.0 ± 0.0
Asn
3.555AsnAla: 3.555 ± 1.757
0.0AsnCys: 0.0 ± 0.0
2.37AsnAsp: 2.37 ± 1.171
1.185AsnGlu: 1.185 ± 1.962
2.37AsnPhe: 2.37 ± 1.171
2.37AsnGly: 2.37 ± 2.374
0.0AsnHis: 0.0 ± 0.0
4.739AsnIle: 4.739 ± 2.342
3.555AsnLys: 3.555 ± 1.757
3.555AsnLeu: 3.555 ± 1.757
1.185AsnMet: 1.185 ± 1.328
3.555AsnAsn: 3.555 ± 3.509
8.294AsnPro: 8.294 ± 2.429
3.555AsnGln: 3.555 ± 1.338
1.185AsnArg: 1.185 ± 0.586
1.185AsnSer: 1.185 ± 1.962
2.37AsnThr: 2.37 ± 1.171
0.0AsnVal: 0.0 ± 0.0
2.37AsnTrp: 2.37 ± 1.171
3.555AsnTyr: 3.555 ± 1.757
0.0AsnXaa: 0.0 ± 0.0
Pro
5.924ProAla: 5.924 ± 1.836
2.37ProCys: 2.37 ± 1.574
3.555ProAsp: 3.555 ± 1.757
2.37ProGlu: 2.37 ± 1.171
1.185ProPhe: 1.185 ± 0.586
2.37ProGly: 2.37 ± 2.374
1.185ProHis: 1.185 ± 0.586
4.739ProIle: 4.739 ± 2.342
9.479ProLys: 9.479 ± 2.791
7.109ProLeu: 7.109 ± 3.513
2.37ProMet: 2.37 ± 1.171
3.555ProAsn: 3.555 ± 1.338
5.924ProPro: 5.924 ± 2.928
1.185ProGln: 1.185 ± 0.586
4.739ProArg: 4.739 ± 2.342
5.924ProSer: 5.924 ± 1.573
3.555ProThr: 3.555 ± 2.971
3.555ProVal: 3.555 ± 1.338
1.185ProTrp: 1.185 ± 0.586
1.185ProTyr: 1.185 ± 0.586
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
3.555GlnCys: 3.555 ± 5.887
2.37GlnAsp: 2.37 ± 1.171
1.185GlnGlu: 1.185 ± 0.586
2.37GlnPhe: 2.37 ± 1.171
2.37GlnGly: 2.37 ± 2.374
0.0GlnHis: 0.0 ± 0.0
4.739GlnIle: 4.739 ± 1.337
3.555GlnLys: 3.555 ± 2.971
5.924GlnLeu: 5.924 ± 1.573
0.0GlnMet: 0.0 ± 0.0
3.555GlnAsn: 3.555 ± 1.757
1.185GlnPro: 1.185 ± 0.586
7.109GlnGln: 7.109 ± 2.676
0.0GlnArg: 0.0 ± 0.0
2.37GlnSer: 2.37 ± 1.171
4.739GlnThr: 4.739 ± 1.337
4.739GlnVal: 4.739 ± 2.342
2.37GlnTrp: 2.37 ± 2.374
3.555GlnTyr: 3.555 ± 1.338
0.0GlnXaa: 0.0 ± 0.0
Arg
1.185ArgAla: 1.185 ± 1.962
0.0ArgCys: 0.0 ± 0.0
1.185ArgAsp: 1.185 ± 0.586
0.0ArgGlu: 0.0 ± 0.0
1.185ArgPhe: 1.185 ± 0.586
1.185ArgGly: 1.185 ± 0.586
3.555ArgHis: 3.555 ± 1.338
2.37ArgIle: 2.37 ± 1.171
7.109ArgLys: 7.109 ± 4.723
8.294ArgLeu: 8.294 ± 0.811
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
4.739ArgPro: 4.739 ± 2.342
3.555ArgGln: 3.555 ± 1.757
17.773ArgArg: 17.773 ± 6.858
0.0ArgSer: 0.0 ± 0.0
3.555ArgThr: 3.555 ± 1.338
0.0ArgVal: 0.0 ± 0.0
2.37ArgTrp: 2.37 ± 1.171
3.555ArgTyr: 3.555 ± 1.757
0.0ArgXaa: 0.0 ± 0.0
Ser
1.185SerAla: 1.185 ± 0.586
0.0SerCys: 0.0 ± 0.0
3.555SerAsp: 3.555 ± 1.338
4.739SerGlu: 4.739 ± 3.148
1.185SerPhe: 1.185 ± 0.586
5.924SerGly: 5.924 ± 1.904
2.37SerHis: 2.37 ± 2.374
5.924SerIle: 5.924 ± 2.928
4.739SerLys: 4.739 ± 1.337
5.924SerLeu: 5.924 ± 1.836
1.185SerMet: 1.185 ± 0.586
1.185SerAsn: 1.185 ± 1.962
3.555SerPro: 3.555 ± 3.509
3.555SerGln: 3.555 ± 1.338
0.0SerArg: 0.0 ± 0.0
15.403SerSer: 15.403 ± 18.35
8.294SerThr: 8.294 ± 4.422
0.0SerVal: 0.0 ± 0.0
1.185SerTrp: 1.185 ± 0.586
2.37SerTyr: 2.37 ± 1.574
0.0SerXaa: 0.0 ± 0.0
Thr
4.739ThrAla: 4.739 ± 2.342
2.37ThrCys: 2.37 ± 1.171
7.109ThrAsp: 7.109 ± 2.081
5.924ThrGlu: 5.924 ± 2.928
3.555ThrPhe: 3.555 ± 2.069
3.555ThrGly: 3.555 ± 2.069
3.555ThrHis: 3.555 ± 3.509
3.555ThrIle: 3.555 ± 1.757
7.109ThrLys: 7.109 ± 2.081
4.739ThrLeu: 4.739 ± 5.464
0.0ThrMet: 0.0 ± 0.0
5.924ThrAsn: 5.924 ± 2.863
4.739ThrPro: 4.739 ± 1.337
2.37ThrGln: 2.37 ± 1.171
2.37ThrArg: 2.37 ± 1.574
10.664ThrSer: 10.664 ± 4.052
9.479ThrThr: 9.479 ± 3.581
1.185ThrVal: 1.185 ± 1.962
1.185ThrTrp: 1.185 ± 0.586
2.37ThrTyr: 2.37 ± 1.171
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
2.37ValGlu: 2.37 ± 2.374
2.37ValPhe: 2.37 ± 1.171
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
3.555ValIle: 3.555 ± 1.757
1.185ValLys: 1.185 ± 1.962
1.185ValLeu: 1.185 ± 2.77
0.0ValMet: 0.0 ± 0.0
1.185ValAsn: 1.185 ± 0.586
2.37ValPro: 2.37 ± 1.171
1.185ValGln: 1.185 ± 0.586
2.37ValArg: 2.37 ± 1.171
0.0ValSer: 0.0 ± 0.0
3.555ValThr: 3.555 ± 1.338
2.37ValVal: 2.37 ± 1.171
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.185TrpAla: 1.185 ± 0.586
0.0TrpCys: 0.0 ± 0.0
1.185TrpAsp: 1.185 ± 0.586
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.37TrpGly: 2.37 ± 1.171
1.185TrpHis: 1.185 ± 0.586
0.0TrpIle: 0.0 ± 0.0
1.185TrpLys: 1.185 ± 2.77
4.739TrpLeu: 4.739 ± 1.9
1.185TrpMet: 1.185 ± 0.586
1.185TrpAsn: 1.185 ± 0.586
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.185TrpArg: 1.185 ± 0.586
2.37TrpSer: 2.37 ± 1.171
2.37TrpThr: 2.37 ± 1.171
1.185TrpVal: 1.185 ± 0.586
1.185TrpTrp: 1.185 ± 0.586
3.555TrpTyr: 3.555 ± 1.757
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.185TyrAsp: 1.185 ± 0.586
1.185TyrGlu: 1.185 ± 0.586
3.555TyrPhe: 3.555 ± 1.757
2.37TyrGly: 2.37 ± 2.374
1.185TyrHis: 1.185 ± 0.586
3.555TyrIle: 3.555 ± 2.069
5.924TyrLys: 5.924 ± 2.863
2.37TyrLeu: 2.37 ± 1.171
0.0TyrMet: 0.0 ± 0.0
2.37TyrAsn: 2.37 ± 1.171
0.0TyrPro: 0.0 ± 0.0
3.555TyrGln: 3.555 ± 1.757
7.109TyrArg: 7.109 ± 3.513
2.37TyrSer: 2.37 ± 1.574
2.37TyrThr: 2.37 ± 1.171
1.185TyrVal: 1.185 ± 0.586
1.185TyrTrp: 1.185 ± 0.586
5.924TyrTyr: 5.924 ± 2.928
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (845 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski