Amino acid dipepetide frequency for Simian torque teno virus 34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.123AlaAla: 8.123 ± 6.807
0.812AlaCys: 0.812 ± 0.488
3.249AlaAsp: 3.249 ± 0.997
5.686AlaGlu: 5.686 ± 4.694
0.812AlaPhe: 0.812 ± 1.216
6.499AlaGly: 6.499 ± 5.948
1.625AlaHis: 1.625 ± 0.863
2.437AlaIle: 2.437 ± 0.948
1.625AlaLys: 1.625 ± 1.007
1.625AlaLeu: 1.625 ± 2.066
0.0AlaMet: 0.0 ± 0.0
0.812AlaAsn: 0.812 ± 1.216
8.936AlaPro: 8.936 ± 2.379
0.0AlaGln: 0.0 ± 0.0
4.874AlaArg: 4.874 ± 1.934
4.874AlaSer: 4.874 ± 2.968
3.249AlaThr: 3.249 ± 1.287
4.062AlaVal: 4.062 ± 3.872
0.0AlaTrp: 0.0 ± 0.0
0.812AlaTyr: 0.812 ± 1.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.812CysCys: 0.812 ± 0.488
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.812CysPhe: 0.812 ± 1.033
1.625CysGly: 1.625 ± 2.066
0.812CysHis: 0.812 ± 0.488
0.0CysIle: 0.0 ± 0.0
0.812CysLys: 0.812 ± 0.488
3.249CysLeu: 3.249 ± 1.951
0.812CysMet: 0.812 ± 1.085
1.625CysAsn: 1.625 ± 0.976
1.625CysPro: 1.625 ± 1.118
0.0CysGln: 0.0 ± 0.0
0.812CysArg: 0.812 ± 1.332
0.812CysSer: 0.812 ± 0.488
0.812CysThr: 0.812 ± 0.488
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.812CysTyr: 0.812 ± 1.332
0.0CysXaa: 0.0 ± 0.0
Asp
2.437AspAla: 2.437 ± 1.265
0.0AspCys: 0.0 ± 0.0
1.625AspAsp: 1.625 ± 0.863
1.625AspGlu: 1.625 ± 0.976
1.625AspPhe: 1.625 ± 1.118
1.625AspGly: 1.625 ± 2.066
0.812AspHis: 0.812 ± 0.488
2.437AspIle: 2.437 ± 0.948
0.0AspLys: 0.0 ± 0.0
4.062AspLeu: 4.062 ± 1.575
0.812AspMet: 0.812 ± 1.033
0.812AspAsn: 0.812 ± 0.488
7.311AspPro: 7.311 ± 3.079
0.0AspGln: 0.0 ± 0.0
2.437AspArg: 2.437 ± 2.41
3.249AspSer: 3.249 ± 3.382
3.249AspThr: 3.249 ± 1.951
1.625AspVal: 1.625 ± 1.007
2.437AspTrp: 2.437 ± 1.177
2.437AspTyr: 2.437 ± 0.948
0.0AspXaa: 0.0 ± 0.0
Glu
4.062GluAla: 4.062 ± 4.244
0.812GluCys: 0.812 ± 0.488
4.874GluAsp: 4.874 ± 2.62
5.686GluGlu: 5.686 ± 2.382
0.812GluPhe: 0.812 ± 1.033
1.625GluGly: 1.625 ± 1.118
0.812GluHis: 0.812 ± 0.488
1.625GluIle: 1.625 ± 0.863
4.062GluLys: 4.062 ± 1.962
4.062GluLeu: 4.062 ± 1.592
1.625GluMet: 1.625 ± 0.975
0.812GluAsn: 0.812 ± 1.332
3.249GluPro: 3.249 ± 1.726
0.812GluGln: 0.812 ± 1.332
4.062GluArg: 4.062 ± 2.921
3.249GluSer: 3.249 ± 0.916
6.499GluThr: 6.499 ± 2.846
3.249GluVal: 3.249 ± 1.277
0.812GluTrp: 0.812 ± 1.216
0.812GluTyr: 0.812 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
1.625PheAla: 1.625 ± 0.976
1.625PheCys: 1.625 ± 0.863
1.625PheAsp: 1.625 ± 1.905
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
2.437PheGly: 2.437 ± 1.014
1.625PheHis: 1.625 ± 0.863
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
1.625PheLeu: 1.625 ± 1.639
1.625PheMet: 1.625 ± 0.976
0.812PheAsn: 0.812 ± 0.488
0.812PhePro: 0.812 ± 0.488
1.625PheGln: 1.625 ± 0.976
4.062PheArg: 4.062 ± 1.592
5.686PheSer: 5.686 ± 1.418
2.437PheThr: 2.437 ± 1.265
0.812PheVal: 0.812 ± 1.332
0.0PheTrp: 0.0 ± 0.0
0.812PheTyr: 0.812 ± 0.488
0.0PheXaa: 0.0 ± 0.0
Gly
4.062GlyAla: 4.062 ± 2.951
0.812GlyCys: 0.812 ± 1.033
5.686GlyAsp: 5.686 ± 2.65
1.625GlyGlu: 1.625 ± 2.066
3.249GlyPhe: 3.249 ± 1.277
9.748GlyGly: 9.748 ± 3.856
1.625GlyHis: 1.625 ± 1.639
3.249GlyIle: 3.249 ± 1.232
1.625GlyLys: 1.625 ± 0.976
4.874GlyLeu: 4.874 ± 1.896
0.812GlyMet: 0.812 ± 0.793
3.249GlyAsn: 3.249 ± 1.951
9.748GlyPro: 9.748 ± 6.169
3.249GlyGln: 3.249 ± 3.303
3.249GlyArg: 3.249 ± 1.236
4.062GlySer: 4.062 ± 1.592
0.812GlyThr: 0.812 ± 1.216
1.625GlyVal: 1.625 ± 0.976
1.625GlyTrp: 1.625 ± 0.863
1.625GlyTyr: 1.625 ± 0.976
0.0GlyXaa: 0.0 ± 0.0
His
0.812HisAla: 0.812 ± 1.033
0.0HisCys: 0.0 ± 0.0
1.625HisAsp: 1.625 ± 0.976
1.625HisGlu: 1.625 ± 0.976
3.249HisPhe: 3.249 ± 0.997
0.812HisGly: 0.812 ± 0.488
0.812HisHis: 0.812 ± 1.332
0.0HisIle: 0.0 ± 0.0
0.812HisLys: 0.812 ± 0.488
4.062HisLeu: 4.062 ± 1.623
1.625HisMet: 1.625 ± 0.976
0.812HisAsn: 0.812 ± 0.488
3.249HisPro: 3.249 ± 1.232
1.625HisGln: 1.625 ± 1.118
4.062HisArg: 4.062 ± 2.683
0.812HisSer: 0.812 ± 0.488
1.625HisThr: 1.625 ± 1.007
0.0HisVal: 0.0 ± 0.0
0.812HisTrp: 0.812 ± 0.488
1.625HisTyr: 1.625 ± 1.118
0.0HisXaa: 0.0 ± 0.0
Ile
4.062IleAla: 4.062 ± 1.746
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
2.437IleGlu: 2.437 ± 1.463
0.0IlePhe: 0.0 ± 0.0
1.625IleGly: 1.625 ± 0.863
0.812IleHis: 0.812 ± 1.216
0.0IleIle: 0.0 ± 0.0
1.625IleLys: 1.625 ± 0.976
3.249IleLeu: 3.249 ± 1.232
0.812IleMet: 0.812 ± 1.332
0.812IleAsn: 0.812 ± 0.488
1.625IlePro: 1.625 ± 0.976
1.625IleGln: 1.625 ± 0.976
2.437IleArg: 2.437 ± 0.948
2.437IleSer: 2.437 ± 2.179
1.625IleThr: 1.625 ± 0.976
0.812IleVal: 0.812 ± 0.488
1.625IleTrp: 1.625 ± 1.007
3.249IleTyr: 3.249 ± 1.236
0.0IleXaa: 0.0 ± 0.0
Lys
0.812LysAla: 0.812 ± 0.488
0.0LysCys: 0.0 ± 0.0
1.625LysAsp: 1.625 ± 0.976
3.249LysGlu: 3.249 ± 1.232
0.812LysPhe: 0.812 ± 0.488
1.625LysGly: 1.625 ± 0.976
1.625LysHis: 1.625 ± 1.007
0.812LysIle: 0.812 ± 1.216
1.625LysLys: 1.625 ± 1.118
1.625LysLeu: 1.625 ± 1.118
0.812LysMet: 0.812 ± 0.488
4.062LysAsn: 4.062 ± 2.439
2.437LysPro: 2.437 ± 1.463
3.249LysGln: 3.249 ± 1.951
6.499LysArg: 6.499 ± 2.573
3.249LysSer: 3.249 ± 2.552
3.249LysThr: 3.249 ± 0.916
2.437LysVal: 2.437 ± 1.463
0.0LysTrp: 0.0 ± 0.0
3.249LysTyr: 3.249 ± 1.232
0.0LysXaa: 0.0 ± 0.0
Leu
6.499LeuAla: 6.499 ± 4.515
2.437LeuCys: 2.437 ± 0.948
1.625LeuAsp: 1.625 ± 0.976
3.249LeuGlu: 3.249 ± 0.916
2.437LeuPhe: 2.437 ± 1.096
3.249LeuGly: 3.249 ± 0.997
0.812LeuHis: 0.812 ± 0.488
1.625LeuIle: 1.625 ± 0.976
4.874LeuLys: 4.874 ± 1.98
6.499LeuLeu: 6.499 ± 1.617
0.812LeuMet: 0.812 ± 0.488
2.437LeuAsn: 2.437 ± 1.014
7.311LeuPro: 7.311 ± 4.241
5.686LeuGln: 5.686 ± 2.416
7.311LeuArg: 7.311 ± 2.247
4.062LeuSer: 4.062 ± 2.16
4.062LeuThr: 4.062 ± 2.439
2.437LeuVal: 2.437 ± 1.463
0.812LeuTrp: 0.812 ± 0.488
5.686LeuTyr: 5.686 ± 3.415
0.0LeuXaa: 0.0 ± 0.0
Met
2.437MetAla: 2.437 ± 1.014
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.625MetGlu: 1.625 ± 0.863
0.812MetPhe: 0.812 ± 1.332
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.625MetIle: 1.625 ± 0.976
1.625MetLys: 1.625 ± 1.118
1.625MetLeu: 1.625 ± 1.007
0.0MetMet: 0.0 ± 0.0
0.812MetAsn: 0.812 ± 0.488
0.812MetPro: 0.812 ± 0.488
0.812MetGln: 0.812 ± 0.488
1.625MetArg: 1.625 ± 0.976
1.625MetSer: 1.625 ± 0.863
0.0MetThr: 0.0 ± 0.0
1.625MetVal: 1.625 ± 0.976
0.812MetTrp: 0.812 ± 0.488
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.437AsnAla: 2.437 ± 1.096
0.812AsnCys: 0.812 ± 0.488
0.812AsnAsp: 0.812 ± 0.488
0.812AsnGlu: 0.812 ± 0.488
0.812AsnPhe: 0.812 ± 0.488
4.062AsnGly: 4.062 ± 0.928
1.625AsnHis: 1.625 ± 0.976
0.812AsnIle: 0.812 ± 0.488
0.812AsnLys: 0.812 ± 0.488
3.249AsnLeu: 3.249 ± 1.236
0.812AsnMet: 0.812 ± 1.216
4.062AsnAsn: 4.062 ± 1.575
2.437AsnPro: 2.437 ± 1.463
3.249AsnGln: 3.249 ± 1.973
1.625AsnArg: 1.625 ± 2.431
0.812AsnSer: 0.812 ± 1.216
2.437AsnThr: 2.437 ± 1.014
1.625AsnVal: 1.625 ± 0.976
1.625AsnTrp: 1.625 ± 0.976
3.249AsnTyr: 3.249 ± 1.951
0.0AsnXaa: 0.0 ± 0.0
Pro
7.311ProAla: 7.311 ± 4.952
3.249ProCys: 3.249 ± 1.277
0.812ProAsp: 0.812 ± 0.488
7.311ProGlu: 7.311 ± 4.57
3.249ProPhe: 3.249 ± 1.951
8.123ProGly: 8.123 ± 2.921
2.437ProHis: 2.437 ± 2.41
1.625ProIle: 1.625 ± 1.007
4.062ProLys: 4.062 ± 1.592
6.499ProLeu: 6.499 ± 2.193
2.437ProMet: 2.437 ± 1.463
1.625ProAsn: 1.625 ± 1.552
12.998ProPro: 12.998 ± 8.812
8.936ProGln: 8.936 ± 4.04
7.311ProArg: 7.311 ± 3.711
5.686ProSer: 5.686 ± 2.848
6.499ProThr: 6.499 ± 3.23
1.625ProVal: 1.625 ± 0.976
3.249ProTrp: 3.249 ± 1.236
3.249ProTyr: 3.249 ± 1.277
0.0ProXaa: 0.0 ± 0.0
Gln
2.437GlnAla: 2.437 ± 1.096
0.0GlnCys: 0.0 ± 0.0
0.812GlnAsp: 0.812 ± 0.488
4.874GlnGlu: 4.874 ± 1.082
0.812GlnPhe: 0.812 ± 0.488
2.437GlnGly: 2.437 ± 3.099
3.249GlnHis: 3.249 ± 1.951
3.249GlnIle: 3.249 ± 3.013
3.249GlnLys: 3.249 ± 1.277
1.625GlnLeu: 1.625 ± 0.976
0.0GlnMet: 0.0 ± 0.0
3.249GlnAsn: 3.249 ± 2.232
4.062GlnPro: 4.062 ± 1.709
6.499GlnGln: 6.499 ± 2.967
4.062GlnArg: 4.062 ± 1.575
4.874GlnSer: 4.874 ± 0.691
4.874GlnThr: 4.874 ± 1.259
4.062GlnVal: 4.062 ± 2.153
2.437GlnTrp: 2.437 ± 0.948
1.625GlnTyr: 1.625 ± 0.976
0.0GlnXaa: 0.0 ± 0.0
Arg
4.062ArgAla: 4.062 ± 2.711
0.812ArgCys: 0.812 ± 0.488
4.874ArgAsp: 4.874 ± 3.354
6.499ArgGlu: 6.499 ± 2.252
1.625ArgPhe: 1.625 ± 0.863
7.311ArgGly: 7.311 ± 2.843
2.437ArgHis: 2.437 ± 1.463
0.0ArgIle: 0.0 ± 0.0
3.249ArgLys: 3.249 ± 2.552
8.123ArgLeu: 8.123 ± 3.767
0.812ArgMet: 0.812 ± 0.419
4.062ArgAsn: 4.062 ± 1.709
9.748ArgPro: 9.748 ± 3.723
6.499ArgGln: 6.499 ± 1.411
25.183ArgArg: 25.183 ± 6.953
4.062ArgSer: 4.062 ± 2.711
4.874ArgThr: 4.874 ± 1.259
3.249ArgVal: 3.249 ± 1.951
1.625ArgTrp: 1.625 ± 1.007
3.249ArgTyr: 3.249 ± 1.232
0.0ArgXaa: 0.0 ± 0.0
Ser
0.812SerAla: 0.812 ± 0.488
0.812SerCys: 0.812 ± 1.332
4.874SerAsp: 4.874 ± 3.096
2.437SerGlu: 2.437 ± 2.179
4.874SerPhe: 4.874 ± 1.492
4.874SerGly: 4.874 ± 1.761
2.437SerHis: 2.437 ± 0.948
3.249SerIle: 3.249 ± 1.236
4.874SerLys: 4.874 ± 1.082
4.874SerLeu: 4.874 ± 1.721
0.812SerMet: 0.812 ± 0.488
2.437SerAsn: 2.437 ± 1.463
7.311SerPro: 7.311 ± 7.281
4.062SerGln: 4.062 ± 0.928
4.062SerArg: 4.062 ± 3.396
12.185SerSer: 12.185 ± 10.894
5.686SerThr: 5.686 ± 2.807
3.249SerVal: 3.249 ± 0.916
0.0SerTrp: 0.0 ± 0.0
2.437SerTyr: 2.437 ± 1.014
0.0SerXaa: 0.0 ± 0.0
Thr
0.812ThrAla: 0.812 ± 1.332
0.812ThrCys: 0.812 ± 1.332
2.437ThrAsp: 2.437 ± 1.096
2.437ThrGlu: 2.437 ± 1.096
1.625ThrPhe: 1.625 ± 1.007
0.812ThrGly: 0.812 ± 0.488
4.874ThrHis: 4.874 ± 1.896
4.062ThrIle: 4.062 ± 2.439
4.062ThrLys: 4.062 ± 2.439
4.874ThrLeu: 4.874 ± 1.259
0.812ThrMet: 0.812 ± 0.488
3.249ThrAsn: 3.249 ± 1.232
8.123ThrPro: 8.123 ± 4.309
4.062ThrGln: 4.062 ± 1.575
5.686ThrArg: 5.686 ± 4.555
4.062ThrSer: 4.062 ± 4.591
3.249ThrThr: 3.249 ± 3.382
4.062ThrVal: 4.062 ± 2.439
1.625ThrTrp: 1.625 ± 0.976
1.625ThrTyr: 1.625 ± 0.976
0.0ThrXaa: 0.0 ± 0.0
Val
2.437ValAla: 2.437 ± 1.014
0.0ValCys: 0.0 ± 0.0
1.625ValAsp: 1.625 ± 0.976
0.812ValGlu: 0.812 ± 1.332
0.812ValPhe: 0.812 ± 0.488
5.686ValGly: 5.686 ± 1.423
0.812ValHis: 0.812 ± 0.488
0.0ValIle: 0.0 ± 0.0
0.812ValLys: 0.812 ± 0.488
2.437ValLeu: 2.437 ± 1.463
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
4.874ValPro: 4.874 ± 1.259
1.625ValGln: 1.625 ± 1.118
5.686ValArg: 5.686 ± 3.415
4.874ValSer: 4.874 ± 1.934
3.249ValThr: 3.249 ± 2.015
0.812ValVal: 0.812 ± 0.488
1.625ValTrp: 1.625 ± 0.863
0.812ValTyr: 0.812 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
3.249TrpAla: 3.249 ± 3.736
1.625TrpCys: 1.625 ± 1.007
0.812TrpAsp: 0.812 ± 0.488
1.625TrpGlu: 1.625 ± 0.863
0.0TrpPhe: 0.0 ± 0.0
0.812TrpGly: 0.812 ± 0.488
0.0TrpHis: 0.0 ± 0.0
1.625TrpIle: 1.625 ± 0.976
0.0TrpLys: 0.0 ± 0.0
0.812TrpLeu: 0.812 ± 0.488
0.812TrpMet: 0.812 ± 0.488
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.625TrpGln: 1.625 ± 0.976
1.625TrpArg: 1.625 ± 0.863
2.437TrpSer: 2.437 ± 0.948
0.0TrpThr: 0.0 ± 0.0
1.625TrpVal: 1.625 ± 0.976
1.625TrpTrp: 1.625 ± 0.863
2.437TrpTyr: 2.437 ± 0.948
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.625TyrAla: 1.625 ± 0.976
0.0TyrCys: 0.0 ± 0.0
1.625TyrAsp: 1.625 ± 0.863
0.812TyrGlu: 0.812 ± 0.488
0.812TyrPhe: 0.812 ± 0.488
1.625TyrGly: 1.625 ± 0.976
0.812TyrHis: 0.812 ± 0.488
2.437TyrIle: 2.437 ± 1.463
3.249TyrLys: 3.249 ± 2.015
4.874TyrLeu: 4.874 ± 3.354
0.812TyrMet: 0.812 ± 0.488
2.437TyrAsn: 2.437 ± 1.463
1.625TyrPro: 1.625 ± 1.118
3.249TyrGln: 3.249 ± 1.236
5.686TyrArg: 5.686 ± 2.504
3.249TyrSer: 3.249 ± 1.951
4.874TyrThr: 4.874 ± 1.98
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1232 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski