Amino acid dipepetide frequency for Tataguine virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.545AlaAla: 2.545 ± 0.362
1.272AlaCys: 1.272 ± 0.816
3.053AlaAsp: 3.053 ± 0.639
4.58AlaGlu: 4.58 ± 3.051
2.545AlaPhe: 2.545 ± 0.92
2.036AlaGly: 2.036 ± 0.643
0.509AlaHis: 0.509 ± 0.321
3.308AlaIle: 3.308 ± 0.535
2.799AlaLys: 2.799 ± 0.431
3.562AlaLeu: 3.562 ± 0.91
1.527AlaMet: 1.527 ± 0.604
3.053AlaAsn: 3.053 ± 0.91
0.509AlaPro: 0.509 ± 0.321
1.018AlaGln: 1.018 ± 0.642
2.799AlaArg: 2.799 ± 0.578
1.781AlaSer: 1.781 ± 0.343
1.527AlaThr: 1.527 ± 1.054
1.272AlaVal: 1.272 ± 0.791
0.254AlaTrp: 0.254 ± 0.16
1.272AlaTyr: 1.272 ± 1.375
0.0AlaXaa: 0.0 ± 0.0
Cys
0.763CysAla: 0.763 ± 0.16
0.509CysCys: 0.509 ± 0.124
1.018CysAsp: 1.018 ± 0.249
1.018CysGlu: 1.018 ± 0.579
2.036CysPhe: 2.036 ± 1.53
3.053CysGly: 3.053 ± 2.482
0.509CysHis: 0.509 ± 0.477
1.018CysIle: 1.018 ± 0.579
3.053CysLys: 3.053 ± 2.482
2.545CysLeu: 2.545 ± 0.603
0.509CysMet: 0.509 ± 0.321
1.018CysAsn: 1.018 ± 0.249
1.527CysPro: 1.527 ± 1.054
1.272CysGln: 1.272 ± 0.447
2.036CysArg: 2.036 ± 0.497
2.545CysSer: 2.545 ± 0.658
1.272CysThr: 1.272 ± 1.192
1.018CysVal: 1.018 ± 0.953
0.0CysTrp: 0.0 ± 0.0
1.018CysTyr: 1.018 ± 0.579
0.0CysXaa: 0.0 ± 0.0
Asp
2.545AspAla: 2.545 ± 0.621
2.036AspCys: 2.036 ± 0.497
2.545AspAsp: 2.545 ± 0.658
2.036AspGlu: 2.036 ± 0.59
5.089AspPhe: 5.089 ± 2.397
1.272AspGly: 1.272 ± 0.791
1.018AspHis: 1.018 ± 0.579
6.87AspIle: 6.87 ± 0.738
1.527AspLys: 1.527 ± 0.373
5.852AspLeu: 5.852 ± 1.684
1.781AspMet: 1.781 ± 0.523
3.562AspAsn: 3.562 ± 0.687
2.29AspPro: 2.29 ± 0.553
1.272AspGln: 1.272 ± 0.579
1.272AspArg: 1.272 ± 0.447
3.817AspSer: 3.817 ± 0.197
2.799AspThr: 2.799 ± 1.4
2.545AspVal: 2.545 ± 0.658
0.763AspTrp: 0.763 ± 0.647
3.308AspTyr: 3.308 ± 1.035
0.0AspXaa: 0.0 ± 0.0
Glu
2.29GluAla: 2.29 ± 0.459
1.527GluCys: 1.527 ± 0.689
2.29GluAsp: 2.29 ± 0.886
5.089GluGlu: 5.089 ± 0.949
3.562GluPhe: 3.562 ± 1.352
3.562GluGly: 3.562 ± 0.687
1.781GluHis: 1.781 ± 0.579
3.562GluIle: 3.562 ± 0.434
4.326GluLys: 4.326 ± 1.33
5.852GluLeu: 5.852 ± 1.069
1.018GluMet: 1.018 ± 0.407
2.545GluAsn: 2.545 ± 1.267
2.545GluPro: 2.545 ± 0.362
1.781GluGln: 1.781 ± 0.762
4.835GluArg: 4.835 ± 1.962
5.344GluSer: 5.344 ± 0.858
3.053GluThr: 3.053 ± 0.91
3.817GluVal: 3.817 ± 0.933
0.763GluTrp: 0.763 ± 0.16
3.562GluTyr: 3.562 ± 0.697
0.0GluXaa: 0.0 ± 0.0
Phe
1.527PheAla: 1.527 ± 0.604
1.781PheCys: 1.781 ± 0.343
1.527PheAsp: 1.527 ± 1.289
3.562PheGlu: 3.562 ± 1.188
3.308PhePhe: 3.308 ± 1.729
1.272PheGly: 1.272 ± 0.579
1.018PheHis: 1.018 ± 0.579
2.545PheIle: 2.545 ± 0.603
4.071PheLys: 4.071 ± 0.801
6.616PheLeu: 6.616 ± 2.697
1.272PheMet: 1.272 ± 0.46
4.835PheAsn: 4.835 ± 1.077
1.018PhePro: 1.018 ± 0.682
0.763PheGln: 0.763 ± 0.16
3.308PheArg: 3.308 ± 1.276
2.545PheSer: 2.545 ± 1.267
3.817PheThr: 3.817 ± 0.961
3.308PheVal: 3.308 ± 0.76
0.254PheTrp: 0.254 ± 0.16
1.527PheTyr: 1.527 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
1.272GlyAla: 1.272 ± 0.46
2.036GlyCys: 2.036 ± 0.803
2.799GlyAsp: 2.799 ± 0.263
3.053GlyGlu: 3.053 ± 0.639
1.527GlyPhe: 1.527 ± 0.948
1.527GlyGly: 1.527 ± 0.948
0.763GlyHis: 0.763 ± 0.715
2.29GlyIle: 2.29 ± 0.356
2.036GlyLys: 2.036 ± 2.02
5.344GlyLeu: 5.344 ± 2.244
0.509GlyMet: 0.509 ± 0.477
1.781GlyAsn: 1.781 ± 0.343
2.29GlyPro: 2.29 ± 0.459
1.781GlyGln: 1.781 ± 0.464
2.799GlyArg: 2.799 ± 0.578
2.545GlySer: 2.545 ± 1.633
3.308GlyThr: 3.308 ± 2.665
3.053GlyVal: 3.053 ± 0.239
1.018GlyTrp: 1.018 ± 0.579
2.036GlyTyr: 2.036 ± 0.803
0.0GlyXaa: 0.0 ± 0.0
His
1.272HisAla: 1.272 ± 0.46
0.509HisCys: 0.509 ± 0.477
0.763HisAsp: 0.763 ± 0.16
0.254HisGlu: 0.254 ± 0.16
0.509HisPhe: 0.509 ± 0.321
1.018HisGly: 1.018 ± 0.295
1.018HisHis: 1.018 ± 0.295
1.781HisIle: 1.781 ± 0.794
1.527HisLys: 1.527 ± 0.609
2.545HisLeu: 2.545 ± 0.895
0.254HisMet: 0.254 ± 0.16
2.036HisAsn: 2.036 ± 0.643
1.272HisPro: 1.272 ± 0.791
0.763HisGln: 0.763 ± 0.345
1.018HisArg: 1.018 ± 0.682
1.272HisSer: 1.272 ± 0.237
1.781HisThr: 1.781 ± 0.579
0.763HisVal: 0.763 ± 0.345
0.254HisTrp: 0.254 ± 0.16
1.018HisTyr: 1.018 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
4.326IleAla: 4.326 ± 0.491
1.781IleCys: 1.781 ± 0.923
5.089IleAsp: 5.089 ± 1.013
5.344IleGlu: 5.344 ± 0.394
2.799IlePhe: 2.799 ± 0.501
3.562IleGly: 3.562 ± 1.825
2.036IleHis: 2.036 ± 0.385
6.361IleIle: 6.361 ± 1.436
6.616IleLys: 6.616 ± 2.135
8.397IleLeu: 8.397 ± 2.221
2.799IleMet: 2.799 ± 1.148
3.308IleAsn: 3.308 ± 1.035
3.053IlePro: 3.053 ± 0.639
2.036IleGln: 2.036 ± 0.385
3.817IleArg: 3.817 ± 1.152
7.125IleSer: 7.125 ± 0.598
5.598IleThr: 5.598 ± 1.096
2.545IleVal: 2.545 ± 0.603
0.763IleTrp: 0.763 ± 0.481
2.036IleTyr: 2.036 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
3.308LysAla: 3.308 ± 0.205
2.036LysCys: 2.036 ± 1.53
4.58LysAsp: 4.58 ± 0.773
6.616LysGlu: 6.616 ± 0.841
3.308LysPhe: 3.308 ± 0.205
2.29LysGly: 2.29 ± 0.741
2.036LysHis: 2.036 ± 0.485
6.107LysIle: 6.107 ± 1.154
4.58LysLys: 4.58 ± 1.568
9.669LysLeu: 9.669 ± 2.279
2.29LysMet: 2.29 ± 1.299
3.817LysAsn: 3.817 ± 1.152
2.036LysPro: 2.036 ± 1.186
2.799LysGln: 2.799 ± 0.54
2.036LysArg: 2.036 ± 0.59
6.361LysSer: 6.361 ± 1.979
7.125LysThr: 7.125 ± 1.506
3.053LysVal: 3.053 ± 0.239
0.763LysTrp: 0.763 ± 0.697
2.545LysTyr: 2.545 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
4.071LeuAla: 4.071 ± 1.584
4.58LeuCys: 4.58 ± 2.452
4.58LeuAsp: 4.58 ± 1.187
6.87LeuGlu: 6.87 ± 1.919
4.835LeuPhe: 4.835 ± 1.33
4.835LeuGly: 4.835 ± 1.915
2.545LeuHis: 2.545 ± 0.363
7.634LeuIle: 7.634 ± 1.151
10.687LeuLys: 10.687 ± 0.935
9.924LeuLeu: 9.924 ± 0.934
2.29LeuMet: 2.29 ± 1.678
6.107LeuAsn: 6.107 ± 0.742
3.562LeuPro: 3.562 ± 0.687
2.29LeuGln: 2.29 ± 0.459
2.29LeuArg: 2.29 ± 0.356
9.924LeuSer: 9.924 ± 1.134
7.888LeuThr: 7.888 ± 2.116
4.326LeuVal: 4.326 ± 1.782
1.018LeuTrp: 1.018 ± 0.295
4.326LeuTyr: 4.326 ± 0.855
0.0LeuXaa: 0.0 ± 0.0
Met
0.509MetAla: 0.509 ± 0.734
1.018MetCys: 1.018 ± 0.295
1.272MetAsp: 1.272 ± 0.802
1.272MetGlu: 1.272 ± 0.237
0.763MetPhe: 0.763 ± 0.16
1.018MetGly: 1.018 ± 0.593
0.254MetHis: 0.254 ± 0.734
3.053MetIle: 3.053 ± 0.746
2.799MetLys: 2.799 ± 0.263
3.308MetLeu: 3.308 ± 0.311
1.272MetMet: 1.272 ± 0.447
2.036MetAsn: 2.036 ± 0.418
0.763MetPro: 0.763 ± 0.481
0.509MetGln: 0.509 ± 0.321
1.527MetArg: 1.527 ± 0.373
3.053MetSer: 3.053 ± 0.639
1.272MetThr: 1.272 ± 0.447
0.763MetVal: 0.763 ± 0.647
0.0MetTrp: 0.0 ± 0.0
2.799MetTyr: 2.799 ± 1.44
0.0MetXaa: 0.0 ± 0.0
Asn
2.545AsnAla: 2.545 ± 0.658
1.781AsnCys: 1.781 ± 1.292
4.326AsnAsp: 4.326 ± 0.491
2.29AsnGlu: 2.29 ± 0.741
2.799AsnPhe: 2.799 ± 0.74
1.272AsnGly: 1.272 ± 0.46
1.018AsnHis: 1.018 ± 0.295
3.308AsnIle: 3.308 ± 0.205
4.326AsnLys: 4.326 ± 0.855
5.344AsnLeu: 5.344 ± 0.842
2.29AsnMet: 2.29 ± 0.459
2.545AsnAsn: 2.545 ± 0.895
3.308AsnPro: 3.308 ± 1.75
2.545AsnGln: 2.545 ± 1.604
1.781AsnArg: 1.781 ± 0.447
5.089AsnSer: 5.089 ± 0.588
3.817AsnThr: 3.817 ± 0.575
2.036AsnVal: 2.036 ± 0.643
0.254AsnTrp: 0.254 ± 0.16
3.053AsnTyr: 3.053 ± 0.639
0.0AsnXaa: 0.0 ± 0.0
Pro
2.29ProAla: 2.29 ± 1.161
0.0ProCys: 0.0 ± 0.0
3.053ProAsp: 3.053 ± 1.791
3.053ProGlu: 3.053 ± 0.577
1.527ProPhe: 1.527 ± 0.651
2.29ProGly: 2.29 ± 0.356
0.254ProHis: 0.254 ± 0.734
4.071ProIle: 4.071 ± 0.835
1.018ProLys: 1.018 ± 0.593
3.562ProLeu: 3.562 ± 0.91
1.018ProMet: 1.018 ± 0.579
1.018ProAsn: 1.018 ± 0.295
0.763ProPro: 0.763 ± 0.16
1.018ProGln: 1.018 ± 0.642
2.29ProArg: 2.29 ± 0.7
2.036ProSer: 2.036 ± 0.59
3.308ProThr: 3.308 ± 0.944
2.29ProVal: 2.29 ± 0.48
0.509ProTrp: 0.509 ± 0.124
1.018ProTyr: 1.018 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
1.272GlnAla: 1.272 ± 0.579
0.509GlnCys: 0.509 ± 0.124
1.527GlnAsp: 1.527 ± 0.604
1.527GlnGlu: 1.527 ± 0.604
1.272GlnPhe: 1.272 ± 0.237
1.272GlnGly: 1.272 ± 0.579
0.0GlnHis: 0.0 ± 0.0
3.053GlnIle: 3.053 ± 0.886
2.545GlnLys: 2.545 ± 1.387
3.053GlnLeu: 3.053 ± 0.886
0.763GlnMet: 0.763 ± 0.808
1.781GlnAsn: 1.781 ± 1.272
0.254GlnPro: 0.254 ± 0.238
0.763GlnGln: 0.763 ± 0.16
1.272GlnArg: 1.272 ± 0.802
2.29GlnSer: 2.29 ± 1.08
2.799GlnThr: 2.799 ± 0.54
1.272GlnVal: 1.272 ± 0.46
0.254GlnTrp: 0.254 ± 0.16
1.781GlnTyr: 1.781 ± 0.579
0.0GlnXaa: 0.0 ± 0.0
Arg
1.527ArgAla: 1.527 ± 0.497
1.272ArgCys: 1.272 ± 0.46
3.053ArgAsp: 3.053 ± 0.577
2.29ArgGlu: 2.29 ± 0.48
1.527ArgPhe: 1.527 ± 0.604
0.509ArgGly: 0.509 ± 0.477
1.527ArgHis: 1.527 ± 0.604
4.071ArgIle: 4.071 ± 0.97
3.817ArgLys: 3.817 ± 0.712
4.58ArgLeu: 4.58 ± 1.806
1.272ArgMet: 1.272 ± 0.575
2.29ArgAsn: 2.29 ± 0.741
1.527ArgPro: 1.527 ± 0.651
1.272ArgGln: 1.272 ± 0.579
1.527ArgArg: 1.527 ± 0.963
2.545ArgSer: 2.545 ± 0.895
3.562ArgThr: 3.562 ± 0.697
2.036ArgVal: 2.036 ± 0.418
0.509ArgTrp: 0.509 ± 0.321
1.272ArgTyr: 1.272 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
1.527SerAla: 1.527 ± 0.32
1.781SerCys: 1.781 ± 1.292
4.326SerAsp: 4.326 ± 1.33
5.852SerGlu: 5.852 ± 1.069
4.326SerPhe: 4.326 ± 0.67
3.053SerGly: 3.053 ± 1.452
1.527SerHis: 1.527 ± 0.373
7.634SerIle: 7.634 ± 1.599
6.87SerLys: 6.87 ± 1.182
7.634SerLeu: 7.634 ± 1.423
3.308SerMet: 3.308 ± 1.035
4.835SerAsn: 4.835 ± 1.529
2.036SerPro: 2.036 ± 0.59
2.799SerGln: 2.799 ± 0.501
2.545SerArg: 2.545 ± 0.895
5.089SerSer: 5.089 ± 1.036
5.852SerThr: 5.852 ± 1.143
3.817SerVal: 3.817 ± 0.712
0.509SerTrp: 0.509 ± 0.124
3.562SerTyr: 3.562 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
3.562ThrAla: 3.562 ± 1.288
1.272ThrCys: 1.272 ± 1.192
3.562ThrAsp: 3.562 ± 0.121
3.562ThrGlu: 3.562 ± 0.929
4.071ThrPhe: 4.071 ± 1.95
5.344ThrGly: 5.344 ± 2.388
1.272ThrHis: 1.272 ± 0.579
6.107ThrIle: 6.107 ± 1.199
4.326ThrLys: 4.326 ± 2.001
5.089ThrLeu: 5.089 ± 0.588
0.763ThrMet: 0.763 ± 0.481
2.545ThrAsn: 2.545 ± 0.603
3.562ThrPro: 3.562 ± 0.687
1.018ThrGln: 1.018 ± 0.593
1.527ThrArg: 1.527 ± 0.32
7.888ThrSer: 7.888 ± 1.551
3.562ThrThr: 3.562 ± 0.375
5.598ThrVal: 5.598 ± 1.395
0.509ThrTrp: 0.509 ± 0.734
2.799ThrTyr: 2.799 ± 0.578
0.0ThrXaa: 0.0 ± 0.0
Val
3.053ValAla: 3.053 ± 0.239
1.527ValCys: 1.527 ± 0.32
2.036ValAsp: 2.036 ± 0.418
2.799ValGlu: 2.799 ± 1.61
1.527ValPhe: 1.527 ± 0.32
2.036ValGly: 2.036 ± 0.385
1.781ValHis: 1.781 ± 0.343
1.781ValIle: 1.781 ± 0.447
4.835ValLys: 4.835 ± 3.247
5.089ValLeu: 5.089 ± 1.522
1.781ValMet: 1.781 ± 1.123
2.036ValAsn: 2.036 ± 0.485
2.036ValPro: 2.036 ± 0.975
1.272ValGln: 1.272 ± 0.46
1.781ValArg: 1.781 ± 0.464
4.58ValSer: 4.58 ± 0.267
3.053ValThr: 3.053 ± 1.791
2.29ValVal: 2.29 ± 0.816
0.0ValTrp: 0.0 ± 0.0
2.29ValTyr: 2.29 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
0.509TrpAla: 0.509 ± 0.124
0.0TrpCys: 0.0 ± 0.0
0.509TrpAsp: 0.509 ± 0.321
1.018TrpGlu: 1.018 ± 0.295
0.763TrpPhe: 0.763 ± 0.16
0.254TrpGly: 0.254 ± 0.238
0.0TrpHis: 0.0 ± 0.0
0.254TrpIle: 0.254 ± 0.238
0.509TrpLys: 0.509 ± 0.124
1.527TrpLeu: 1.527 ± 0.604
0.509TrpMet: 0.509 ± 0.734
0.509TrpAsn: 0.509 ± 0.321
0.0TrpPro: 0.0 ± 0.0
0.509TrpGln: 0.509 ± 0.697
0.0TrpArg: 0.0 ± 0.0
1.018TrpSer: 1.018 ± 0.295
0.254TrpThr: 0.254 ± 0.734
0.509TrpVal: 0.509 ± 0.321
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.509TyrAla: 0.509 ± 0.124
0.509TyrCys: 0.509 ± 0.124
2.29TyrAsp: 2.29 ± 0.553
1.272TyrGlu: 1.272 ± 0.237
1.781TyrPhe: 1.781 ± 0.762
2.29TyrGly: 2.29 ± 0.816
0.763TyrHis: 0.763 ± 0.345
4.58TyrIle: 4.58 ± 0.855
5.089TyrLys: 5.089 ± 1.243
5.089TyrLeu: 5.089 ± 0.269
1.781TyrMet: 1.781 ± 0.334
4.071TyrAsn: 4.071 ± 0.77
2.036TyrPro: 2.036 ± 1.275
1.781TyrGln: 1.781 ± 1.123
1.527TyrArg: 1.527 ± 0.373
2.036TyrSer: 2.036 ± 0.803
2.29TyrThr: 2.29 ± 0.497
1.272TyrVal: 1.272 ± 0.237
0.254TyrTrp: 0.254 ± 0.16
0.254TyrTyr: 0.254 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski