Amino acid dipepetide frequency for Trichodysplasia spinulosa-associated polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.695AlaAla: 5.695 ± 2.655
0.475AlaCys: 0.475 ± 0.36
0.949AlaAsp: 0.949 ± 0.72
2.848AlaGlu: 2.848 ± 0.725
0.475AlaPhe: 0.475 ± 0.456
1.898AlaGly: 1.898 ± 0.932
2.848AlaHis: 2.848 ± 1.355
1.898AlaIle: 1.898 ± 0.727
0.949AlaLys: 0.949 ± 0.72
6.17AlaLeu: 6.17 ± 1.439
0.475AlaMet: 0.475 ± 0.483
1.898AlaAsn: 1.898 ± 0.948
3.322AlaPro: 3.322 ± 1.268
0.0AlaGln: 0.0 ± 0.0
5.695AlaArg: 5.695 ± 1.751
3.797AlaSer: 3.797 ± 2.274
2.848AlaThr: 2.848 ± 0.856
4.746AlaVal: 4.746 ± 1.848
2.373AlaTrp: 2.373 ± 1.187
0.475AlaTyr: 0.475 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
2.848CysAla: 2.848 ± 1.084
0.949CysCys: 0.949 ± 0.63
0.475CysAsp: 0.475 ± 0.456
0.0CysGlu: 0.0 ± 0.0
0.949CysPhe: 0.949 ± 0.66
0.475CysGly: 0.475 ± 0.483
0.475CysHis: 0.475 ± 0.36
0.949CysIle: 0.949 ± 0.66
3.797CysLys: 3.797 ± 1.065
4.271CysLeu: 4.271 ± 1.551
0.475CysMet: 0.475 ± 0.36
0.475CysAsn: 0.475 ± 0.456
3.322CysPro: 3.322 ± 1.083
0.475CysGln: 0.475 ± 0.36
0.475CysArg: 0.475 ± 0.483
0.949CysSer: 0.949 ± 0.72
1.424CysThr: 1.424 ± 1.08
1.424CysVal: 1.424 ± 0.578
0.475CysTrp: 0.475 ± 0.456
3.797CysTyr: 3.797 ± 1.747
0.0CysXaa: 0.0 ± 0.0
Asp
1.424AspAla: 1.424 ± 1.254
0.0AspCys: 0.0 ± 0.0
0.949AspAsp: 0.949 ± 0.512
4.746AspGlu: 4.746 ± 0.571
2.373AspPhe: 2.373 ± 0.952
4.271AspGly: 4.271 ± 1.5
1.424AspHis: 1.424 ± 0.677
3.797AspIle: 3.797 ± 1.552
6.17AspLys: 6.17 ± 1.955
6.645AspLeu: 6.645 ± 2.183
1.424AspMet: 1.424 ± 0.901
1.898AspAsn: 1.898 ± 1.025
4.271AspPro: 4.271 ± 1.184
0.949AspGln: 0.949 ± 0.512
0.949AspArg: 0.949 ± 0.805
3.322AspSer: 3.322 ± 1.143
0.475AspThr: 0.475 ± 0.36
1.898AspVal: 1.898 ± 1.439
0.949AspTrp: 0.949 ± 0.805
1.898AspTyr: 1.898 ± 0.577
0.0AspXaa: 0.0 ± 0.0
Glu
3.797GluAla: 3.797 ± 2.262
3.322GluCys: 3.322 ± 1.013
4.271GluAsp: 4.271 ± 0.667
10.441GluGlu: 10.441 ± 2.418
3.322GluPhe: 3.322 ± 1.586
5.221GluGly: 5.221 ± 0.699
0.0GluHis: 0.0 ± 0.0
1.424GluIle: 1.424 ± 0.737
4.746GluLys: 4.746 ± 0.785
5.221GluLeu: 5.221 ± 0.423
0.0GluMet: 0.0 ± 0.0
5.221GluAsn: 5.221 ± 0.455
1.424GluPro: 1.424 ± 0.759
0.0GluGln: 0.0 ± 0.0
2.373GluArg: 2.373 ± 0.926
6.645GluSer: 6.645 ± 1.523
1.898GluThr: 1.898 ± 0.727
5.695GluVal: 5.695 ± 1.738
0.475GluTrp: 0.475 ± 0.36
2.373GluTyr: 2.373 ± 0.952
0.0GluXaa: 0.0 ± 0.0
Phe
5.695PheAla: 5.695 ± 1.157
1.424PheCys: 1.424 ± 0.578
0.949PheAsp: 0.949 ± 0.463
2.373PheGlu: 2.373 ± 1.799
0.949PhePhe: 0.949 ± 0.689
3.797PheGly: 3.797 ± 1.138
1.424PheHis: 1.424 ± 0.678
0.0PheIle: 0.0 ± 0.0
1.898PheLys: 1.898 ± 1.439
5.695PheLeu: 5.695 ± 1.748
0.475PheMet: 0.475 ± 0.413
2.848PheAsn: 2.848 ± 0.84
5.695PhePro: 5.695 ± 0.955
2.848PheGln: 2.848 ± 0.911
0.475PheArg: 0.475 ± 0.36
4.271PheSer: 4.271 ± 1.261
1.898PheThr: 1.898 ± 1.025
0.475PheVal: 0.475 ± 0.36
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.898GlyAla: 1.898 ± 1.194
1.424GlyCys: 1.424 ± 0.759
3.797GlyAsp: 3.797 ± 1.003
4.746GlyGlu: 4.746 ± 2.032
2.848GlyPhe: 2.848 ± 0.731
4.746GlyGly: 4.746 ± 0.769
1.898GlyHis: 1.898 ± 0.727
4.271GlyIle: 4.271 ± 1.238
4.746GlyLys: 4.746 ± 1.46
7.119GlyLeu: 7.119 ± 3.307
0.949GlyMet: 0.949 ± 0.639
2.373GlyAsn: 2.373 ± 0.805
4.271GlyPro: 4.271 ± 1.009
3.322GlyGln: 3.322 ± 1.895
1.424GlyArg: 1.424 ± 0.677
2.373GlySer: 2.373 ± 0.604
3.322GlyThr: 3.322 ± 1.819
3.797GlyVal: 3.797 ± 0.906
0.0GlyTrp: 0.0 ± 0.0
1.898GlyTyr: 1.898 ± 1.194
0.0GlyXaa: 0.0 ± 0.0
His
1.424HisAla: 1.424 ± 1.254
1.898HisCys: 1.898 ± 0.7
0.475HisAsp: 0.475 ± 0.36
1.424HisGlu: 1.424 ± 0.677
0.949HisPhe: 0.949 ± 0.805
0.0HisGly: 0.0 ± 0.0
1.424HisHis: 1.424 ± 0.578
0.0HisIle: 0.0 ± 0.0
1.898HisLys: 1.898 ± 0.7
2.848HisLeu: 2.848 ± 1.045
0.949HisMet: 0.949 ± 0.912
0.475HisAsn: 0.475 ± 0.36
1.898HisPro: 1.898 ± 0.7
3.797HisGln: 3.797 ± 1.735
0.475HisArg: 0.475 ± 0.36
0.949HisSer: 0.949 ± 0.72
1.424HisThr: 1.424 ± 0.677
1.424HisVal: 1.424 ± 0.578
1.898HisTrp: 1.898 ± 1.053
2.373HisTyr: 2.373 ± 0.573
0.0HisXaa: 0.0 ± 0.0
Ile
0.949IleAla: 0.949 ± 0.72
0.949IleCys: 0.949 ± 0.463
1.424IleAsp: 1.424 ± 0.759
4.271IleGlu: 4.271 ± 1.507
2.373IlePhe: 2.373 ± 1.309
2.848IleGly: 2.848 ± 1.302
0.949IleHis: 0.949 ± 0.805
0.475IleIle: 0.475 ± 0.544
4.271IleLys: 4.271 ± 1.33
4.271IleLeu: 4.271 ± 1.227
0.949IleMet: 0.949 ± 0.512
2.373IleAsn: 2.373 ± 0.604
3.797IlePro: 3.797 ± 0.404
0.475IleGln: 0.475 ± 0.483
1.424IleArg: 1.424 ± 1.254
4.271IleSer: 4.271 ± 2.945
1.898IleThr: 1.898 ± 0.577
2.848IleVal: 2.848 ± 0.809
0.0IleTrp: 0.0 ± 0.0
3.322IleTyr: 3.322 ± 0.884
0.0IleXaa: 0.0 ± 0.0
Lys
3.322LysAla: 3.322 ± 1.274
3.797LysCys: 3.797 ± 1.714
2.373LysAsp: 2.373 ± 1.393
3.322LysGlu: 3.322 ± 1.233
2.373LysPhe: 2.373 ± 0.604
3.797LysGly: 3.797 ± 1.575
2.848LysHis: 2.848 ± 0.84
4.271LysIle: 4.271 ± 1.122
7.594LysLys: 7.594 ± 2.215
7.594LysLeu: 7.594 ± 3.028
2.848LysMet: 2.848 ± 1.515
2.373LysAsn: 2.373 ± 0.91
2.373LysPro: 2.373 ± 1.244
2.373LysGln: 2.373 ± 0.604
3.322LysArg: 3.322 ± 1.083
0.949LysSer: 0.949 ± 0.72
3.797LysThr: 3.797 ± 1.996
0.949LysVal: 0.949 ± 0.512
0.0LysTrp: 0.0 ± 0.0
3.797LysTyr: 3.797 ± 1.401
0.0LysXaa: 0.0 ± 0.0
Leu
0.949LeuAla: 0.949 ± 0.689
1.898LeuCys: 1.898 ± 1.072
5.695LeuAsp: 5.695 ± 1.185
5.221LeuGlu: 5.221 ± 0.938
5.695LeuPhe: 5.695 ± 0.992
2.848LeuGly: 2.848 ± 1.457
1.424LeuHis: 1.424 ± 0.677
5.695LeuIle: 5.695 ± 0.844
4.746LeuLys: 4.746 ± 1.793
8.543LeuLeu: 8.543 ± 3.695
9.018LeuMet: 9.018 ± 1.954
7.119LeuAsn: 7.119 ± 1.516
6.17LeuPro: 6.17 ± 1.645
9.018LeuGln: 9.018 ± 1.192
4.271LeuArg: 4.271 ± 0.477
5.695LeuSer: 5.695 ± 1.587
6.17LeuThr: 6.17 ± 1.029
5.221LeuVal: 5.221 ± 1.262
2.373LeuTrp: 2.373 ± 1.187
6.17LeuTyr: 6.17 ± 1.746
0.0LeuXaa: 0.0 ± 0.0
Met
4.271MetAla: 4.271 ± 1.092
0.475MetCys: 0.475 ± 0.36
3.322MetAsp: 3.322 ± 1.013
0.949MetGlu: 0.949 ± 0.689
1.898MetPhe: 1.898 ± 0.685
2.373MetGly: 2.373 ± 0.875
1.424MetHis: 1.424 ± 0.852
1.424MetIle: 1.424 ± 0.894
1.898MetLys: 1.898 ± 0.532
2.848MetLeu: 2.848 ± 0.925
0.475MetMet: 0.475 ± 0.483
2.373MetAsn: 2.373 ± 0.91
0.475MetPro: 0.475 ± 0.456
0.949MetGln: 0.949 ± 0.512
0.949MetArg: 0.949 ± 0.805
2.848MetSer: 2.848 ± 1.155
1.424MetThr: 1.424 ± 0.495
0.949MetVal: 0.949 ± 0.512
0.475MetTrp: 0.475 ± 0.456
0.475MetTyr: 0.475 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
2.373AsnAla: 2.373 ± 0.952
0.949AsnCys: 0.949 ± 0.72
1.898AsnAsp: 1.898 ± 1.025
2.373AsnGlu: 2.373 ± 1.244
3.797AsnPhe: 3.797 ± 0.81
1.898AsnGly: 1.898 ± 0.899
2.373AsnHis: 2.373 ± 0.829
2.848AsnIle: 2.848 ± 1.07
1.424AsnLys: 1.424 ± 1.08
8.068AsnLeu: 8.068 ± 2.246
0.0AsnMet: 0.0 ± 0.0
0.475AsnAsn: 0.475 ± 0.456
1.898AsnPro: 1.898 ± 0.899
2.848AsnGln: 2.848 ± 1.155
0.949AsnArg: 0.949 ± 0.463
2.373AsnSer: 2.373 ± 0.573
1.898AsnThr: 1.898 ± 1.823
3.797AsnVal: 3.797 ± 0.404
1.424AsnTrp: 1.424 ± 0.578
1.424AsnTyr: 1.424 ± 0.901
0.0AsnXaa: 0.0 ± 0.0
Pro
2.373ProAla: 2.373 ± 1.128
0.949ProCys: 0.949 ± 0.463
5.695ProAsp: 5.695 ± 0.854
2.848ProGlu: 2.848 ± 1.258
2.848ProPhe: 2.848 ± 1.051
6.645ProGly: 6.645 ± 1.876
0.949ProHis: 0.949 ± 0.63
1.424ProIle: 1.424 ± 0.759
3.322ProLys: 3.322 ± 1.745
4.271ProLeu: 4.271 ± 1.114
1.424ProMet: 1.424 ± 0.988
0.475ProAsn: 0.475 ± 0.36
6.645ProPro: 6.645 ± 1.853
3.322ProGln: 3.322 ± 1.884
2.848ProArg: 2.848 ± 1.278
4.271ProSer: 4.271 ± 1.11
5.221ProThr: 5.221 ± 1.131
2.848ProVal: 2.848 ± 2.225
0.949ProTrp: 0.949 ± 0.805
0.949ProTyr: 0.949 ± 0.512
0.0ProXaa: 0.0 ± 0.0
Gln
1.898GlnAla: 1.898 ± 0.679
0.949GlnCys: 0.949 ± 0.66
3.322GlnAsp: 3.322 ± 0.85
5.221GlnGlu: 5.221 ± 1.321
1.898GlnPhe: 1.898 ± 1.025
2.373GlnGly: 2.373 ± 0.805
1.424GlnHis: 1.424 ± 0.578
4.271GlnIle: 4.271 ± 0.968
4.746GlnLys: 4.746 ± 1.2
2.373GlnLeu: 2.373 ± 0.829
1.898GlnMet: 1.898 ± 0.681
0.475GlnAsn: 0.475 ± 0.456
1.424GlnPro: 1.424 ± 0.551
0.475GlnGln: 0.475 ± 0.36
1.898GlnArg: 1.898 ± 0.679
1.898GlnSer: 1.898 ± 0.899
1.898GlnThr: 1.898 ± 1.082
3.322GlnVal: 3.322 ± 0.376
0.0GlnTrp: 0.0 ± 0.0
0.949GlnTyr: 0.949 ± 0.512
0.0GlnXaa: 0.0 ± 0.0
Arg
0.949ArgAla: 0.949 ± 0.463
0.475ArgCys: 0.475 ± 0.36
3.322ArgAsp: 3.322 ± 1.268
4.746ArgGlu: 4.746 ± 2.692
0.949ArgPhe: 0.949 ± 0.72
1.898ArgGly: 1.898 ± 0.577
2.848ArgHis: 2.848 ± 0.725
1.424ArgIle: 1.424 ± 0.551
2.373ArgLys: 2.373 ± 1.393
4.746ArgLeu: 4.746 ± 0.958
3.322ArgMet: 3.322 ± 1.451
1.424ArgAsn: 1.424 ± 0.502
1.898ArgPro: 1.898 ± 0.577
1.424ArgGln: 1.424 ± 0.578
4.746ArgArg: 4.746 ± 1.916
1.424ArgSer: 1.424 ± 1.08
1.898ArgThr: 1.898 ± 1.26
2.848ArgVal: 2.848 ± 0.615
0.949ArgTrp: 0.949 ± 0.805
3.797ArgTyr: 3.797 ± 2.044
0.0ArgXaa: 0.0 ± 0.0
Ser
4.746SerAla: 4.746 ± 1.912
2.373SerCys: 2.373 ± 0.604
1.424SerAsp: 1.424 ± 0.759
3.322SerGlu: 3.322 ± 1.133
3.797SerPhe: 3.797 ± 1.926
2.373SerGly: 2.373 ± 0.851
0.949SerHis: 0.949 ± 0.805
2.848SerIle: 2.848 ± 0.706
2.373SerLys: 2.373 ± 1.244
9.018SerLeu: 9.018 ± 2.643
1.424SerMet: 1.424 ± 0.874
2.373SerAsn: 2.373 ± 0.926
2.373SerPro: 2.373 ± 0.926
2.848SerGln: 2.848 ± 0.802
7.119SerArg: 7.119 ± 1.965
5.221SerSer: 5.221 ± 0.685
3.797SerThr: 3.797 ± 1.695
1.898SerVal: 1.898 ± 1.072
1.898SerTrp: 1.898 ± 0.7
1.424SerTyr: 1.424 ± 0.773
0.0SerXaa: 0.0 ± 0.0
Thr
0.949ThrAla: 0.949 ± 0.512
2.373ThrCys: 2.373 ± 1.203
2.373ThrAsp: 2.373 ± 0.573
1.898ThrGlu: 1.898 ± 0.839
2.373ThrPhe: 2.373 ± 1.444
3.322ThrGly: 3.322 ± 1.865
0.475ThrHis: 0.475 ± 0.36
2.373ThrIle: 2.373 ± 0.846
2.373ThrLys: 2.373 ± 1.393
3.322ThrLeu: 3.322 ± 0.715
2.373ThrMet: 2.373 ± 0.846
3.322ThrAsn: 3.322 ± 1.046
5.221ThrPro: 5.221 ± 0.984
3.797ThrGln: 3.797 ± 1.651
2.373ThrArg: 2.373 ± 1.393
1.898ThrSer: 1.898 ± 0.532
3.322ThrThr: 3.322 ± 0.557
3.797ThrVal: 3.797 ± 1.667
0.0ThrTrp: 0.0 ± 0.0
1.424ThrTyr: 1.424 ± 0.773
0.0ThrXaa: 0.0 ± 0.0
Val
1.424ValAla: 1.424 ± 0.502
0.949ValCys: 0.949 ± 0.805
1.424ValAsp: 1.424 ± 0.852
4.746ValGlu: 4.746 ± 1.521
0.949ValPhe: 0.949 ± 0.463
4.746ValGly: 4.746 ± 2.215
0.475ValHis: 0.475 ± 0.456
4.271ValIle: 4.271 ± 0.98
1.898ValLys: 1.898 ± 1.072
5.221ValLeu: 5.221 ± 2.451
0.949ValMet: 0.949 ± 0.463
4.746ValAsn: 4.746 ± 0.92
2.373ValPro: 2.373 ± 1.244
1.424ValGln: 1.424 ± 0.773
1.424ValArg: 1.424 ± 1.367
5.221ValSer: 5.221 ± 1.494
3.797ValThr: 3.797 ± 1.996
4.271ValVal: 4.271 ± 1.494
3.322ValTrp: 3.322 ± 1.529
1.898ValTyr: 1.898 ± 0.532
0.0ValXaa: 0.0 ± 0.0
Trp
2.373TrpAla: 2.373 ± 0.829
0.475TrpCys: 0.475 ± 0.456
2.373TrpAsp: 2.373 ± 1.444
1.898TrpGlu: 1.898 ± 0.532
0.949TrpPhe: 0.949 ± 0.66
1.424TrpGly: 1.424 ± 1.083
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.475TrpLys: 0.475 ± 0.36
0.0TrpLeu: 0.0 ± 0.0
0.949TrpMet: 0.949 ± 0.805
0.475TrpAsn: 0.475 ± 0.36
0.949TrpPro: 0.949 ± 0.66
1.424TrpGln: 1.424 ± 0.578
0.949TrpArg: 0.949 ± 0.805
0.949TrpSer: 0.949 ± 0.66
0.0TrpThr: 0.0 ± 0.0
1.424TrpVal: 1.424 ± 0.894
0.475TrpTrp: 0.475 ± 0.36
1.898TrpTyr: 1.898 ± 0.7
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.475TyrAla: 0.475 ± 0.544
2.373TyrCys: 2.373 ± 1.187
3.322TyrAsp: 3.322 ± 0.696
0.475TyrGlu: 0.475 ± 0.36
1.424TyrPhe: 1.424 ± 0.678
4.271TyrGly: 4.271 ± 0.69
2.373TyrHis: 2.373 ± 0.91
0.475TyrIle: 0.475 ± 0.456
2.848TyrLys: 2.848 ± 1.084
5.695TyrLeu: 5.695 ± 0.844
1.424TyrMet: 1.424 ± 0.762
1.898TyrAsn: 1.898 ± 0.7
0.475TyrPro: 0.475 ± 0.456
0.949TyrGln: 0.949 ± 0.805
3.322TyrArg: 3.322 ± 1.316
4.271TyrSer: 4.271 ± 1.306
0.949TyrThr: 0.949 ± 0.512
1.898TyrVal: 1.898 ± 0.577
1.424TyrTrp: 1.424 ± 0.578
3.322TyrTyr: 3.322 ± 2.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski