Amino acid dipepetide frequency for African pouched rat arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.563AlaAla: 11.563 ± 1.803
2.313AlaCys: 2.313 ± 0.58
2.857AlaAsp: 2.857 ± 0.729
4.897AlaGlu: 4.897 ± 0.899
3.265AlaPhe: 3.265 ± 0.821
5.441AlaGly: 5.441 ± 0.856
3.129AlaHis: 3.129 ± 0.9
3.945AlaIle: 3.945 ± 1.429
4.353AlaLys: 4.353 ± 1.578
11.835AlaLeu: 11.835 ± 1.553
1.905AlaMet: 1.905 ± 0.471
3.673AlaAsn: 3.673 ± 0.989
4.897AlaPro: 4.897 ± 1.199
2.177AlaGln: 2.177 ± 0.605
3.673AlaArg: 3.673 ± 0.789
6.122AlaSer: 6.122 ± 1.732
4.353AlaThr: 4.353 ± 1.562
10.747AlaVal: 10.747 ± 2.092
1.088AlaTrp: 1.088 ± 0.28
1.768AlaTyr: 1.768 ± 0.548
0.0AlaXaa: 0.0 ± 0.0
Cys
1.768CysAla: 1.768 ± 0.61
1.224CysCys: 1.224 ± 0.593
2.585CysAsp: 2.585 ± 0.729
0.408CysGlu: 0.408 ± 0.133
1.36CysPhe: 1.36 ± 1.031
4.217CysGly: 4.217 ± 0.546
0.952CysHis: 0.952 ± 0.258
1.632CysIle: 1.632 ± 0.388
1.496CysLys: 1.496 ± 0.294
3.809CysLeu: 3.809 ± 0.65
0.272CysMet: 0.272 ± 0.203
0.816CysAsn: 0.816 ± 0.459
1.905CysPro: 1.905 ± 0.311
1.088CysGln: 1.088 ± 0.365
1.496CysArg: 1.496 ± 0.366
2.857CysSer: 2.857 ± 0.757
2.993CysThr: 2.993 ± 0.505
1.905CysVal: 1.905 ± 1.065
0.68CysTrp: 0.68 ± 0.301
1.496CysTyr: 1.496 ± 0.358
0.0CysXaa: 0.0 ± 0.0
Asp
2.585AspAla: 2.585 ± 0.526
1.224AspCys: 1.224 ± 0.376
1.632AspAsp: 1.632 ± 0.42
2.857AspGlu: 2.857 ± 0.523
2.177AspPhe: 2.177 ± 0.513
4.353AspGly: 4.353 ± 1.085
1.36AspHis: 1.36 ± 0.316
1.768AspIle: 1.768 ± 0.315
1.632AspLys: 1.632 ± 0.416
5.305AspLeu: 5.305 ± 0.861
0.816AspMet: 0.816 ± 0.525
1.088AspAsn: 1.088 ± 0.285
1.905AspPro: 1.905 ± 0.44
2.041AspGln: 2.041 ± 0.555
2.177AspArg: 2.177 ± 0.799
3.537AspSer: 3.537 ± 0.82
1.224AspThr: 1.224 ± 0.621
3.809AspVal: 3.809 ± 1.176
0.272AspTrp: 0.272 ± 0.093
1.36AspTyr: 1.36 ± 0.44
0.0AspXaa: 0.0 ± 0.0
Glu
3.129GluAla: 3.129 ± 0.602
0.816GluCys: 0.816 ± 0.265
2.313GluAsp: 2.313 ± 0.469
2.857GluGlu: 2.857 ± 0.54
1.905GluPhe: 1.905 ± 0.491
4.625GluGly: 4.625 ± 1.036
1.224GluHis: 1.224 ± 1.185
1.768GluIle: 1.768 ± 0.567
0.952GluLys: 0.952 ± 0.31
3.401GluLeu: 3.401 ± 0.719
1.088GluMet: 1.088 ± 0.306
0.952GluAsn: 0.952 ± 1.089
1.632GluPro: 1.632 ± 0.344
2.177GluGln: 2.177 ± 0.477
2.041GluArg: 2.041 ± 0.416
3.129GluSer: 3.129 ± 0.57
2.041GluThr: 2.041 ± 0.435
2.993GluVal: 2.993 ± 0.395
0.136GluTrp: 0.136 ± 0.354
1.496GluTyr: 1.496 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
4.353PheAla: 4.353 ± 0.881
1.632PheCys: 1.632 ± 0.302
1.905PheAsp: 1.905 ± 0.36
2.313PheGlu: 2.313 ± 0.854
1.224PhePhe: 1.224 ± 0.678
2.313PheGly: 2.313 ± 0.734
0.544PheHis: 0.544 ± 0.365
2.041PheIle: 2.041 ± 0.763
0.68PheLys: 0.68 ± 0.293
3.945PheLeu: 3.945 ± 1.26
0.952PheMet: 0.952 ± 0.678
1.088PheAsn: 1.088 ± 0.298
1.905PhePro: 1.905 ± 0.312
2.041PheGln: 2.041 ± 0.882
1.905PheArg: 1.905 ± 0.806
1.768PheSer: 1.768 ± 1.458
2.857PheThr: 2.857 ± 0.32
4.217PheVal: 4.217 ± 0.734
0.136PheTrp: 0.136 ± 0.091
1.224PheTyr: 1.224 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
7.482GlyAla: 7.482 ± 1.081
1.768GlyCys: 1.768 ± 0.328
3.537GlyAsp: 3.537 ± 0.541
3.129GlyGlu: 3.129 ± 0.745
2.857GlyPhe: 2.857 ± 1.119
4.761GlyGly: 4.761 ± 0.917
1.905GlyHis: 1.905 ± 0.422
3.945GlyIle: 3.945 ± 0.443
4.217GlyLys: 4.217 ± 0.853
4.081GlyLeu: 4.081 ± 0.905
1.496GlyMet: 1.496 ± 0.406
3.265GlyAsn: 3.265 ± 0.636
3.945GlyPro: 3.945 ± 0.875
1.496GlyGln: 1.496 ± 0.52
3.129GlyArg: 3.129 ± 0.618
6.258GlySer: 6.258 ± 0.75
2.585GlyThr: 2.585 ± 0.408
8.842GlyVal: 8.842 ± 1.717
1.224GlyTrp: 1.224 ± 0.311
3.401GlyTyr: 3.401 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
2.449HisAla: 2.449 ± 0.699
0.952HisCys: 0.952 ± 0.296
1.905HisAsp: 1.905 ± 0.456
1.088HisGlu: 1.088 ± 0.438
0.816HisPhe: 0.816 ± 0.686
3.673HisGly: 3.673 ± 0.504
1.496HisHis: 1.496 ± 0.722
1.088HisIle: 1.088 ± 0.29
1.224HisLys: 1.224 ± 0.319
2.721HisLeu: 2.721 ± 0.883
0.136HisMet: 0.136 ± 0.091
1.36HisAsn: 1.36 ± 0.532
1.768HisPro: 1.768 ± 0.982
1.088HisGln: 1.088 ± 0.708
1.496HisArg: 1.496 ± 0.522
1.36HisSer: 1.36 ± 0.497
1.632HisThr: 1.632 ± 0.275
2.313HisVal: 2.313 ± 0.564
0.952HisTrp: 0.952 ± 0.364
0.272HisTyr: 0.272 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
2.041IleAla: 2.041 ± 0.829
2.585IleCys: 2.585 ± 0.44
0.544IleAsp: 0.544 ± 0.232
1.224IleGlu: 1.224 ± 0.674
1.224IlePhe: 1.224 ± 0.378
3.401IleGly: 3.401 ± 0.662
0.952IleHis: 0.952 ± 0.325
1.36IleIle: 1.36 ± 0.946
1.768IleLys: 1.768 ± 0.337
3.401IleLeu: 3.401 ± 1.479
0.408IleMet: 0.408 ± 0.673
0.68IleAsn: 0.68 ± 0.21
2.993IlePro: 2.993 ± 0.489
1.496IleGln: 1.496 ± 0.392
2.177IleArg: 2.177 ± 0.383
3.401IleSer: 3.401 ± 0.911
2.449IleThr: 2.449 ± 0.493
3.673IleVal: 3.673 ± 0.978
1.224IleTrp: 1.224 ± 0.273
1.088IleTyr: 1.088 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
4.897LysAla: 4.897 ± 0.756
0.544LysCys: 0.544 ± 0.185
1.088LysAsp: 1.088 ± 0.25
1.088LysGlu: 1.088 ± 0.28
1.496LysPhe: 1.496 ± 0.366
2.721LysGly: 2.721 ± 0.766
1.496LysHis: 1.496 ± 0.294
1.496LysIle: 1.496 ± 0.47
0.544LysLys: 0.544 ± 0.278
5.033LysLeu: 5.033 ± 0.627
1.224LysMet: 1.224 ± 0.25
1.088LysAsn: 1.088 ± 0.264
1.905LysPro: 1.905 ± 0.503
1.36LysGln: 1.36 ± 0.409
1.905LysArg: 1.905 ± 0.661
2.313LysSer: 2.313 ± 0.359
2.857LysThr: 2.857 ± 0.776
2.721LysVal: 2.721 ± 0.538
0.952LysTrp: 0.952 ± 0.24
2.313LysTyr: 2.313 ± 0.998
0.0LysXaa: 0.0 ± 0.0
Leu
9.386LeuAla: 9.386 ± 0.73
3.537LeuCys: 3.537 ± 1.313
3.809LeuAsp: 3.809 ± 0.517
3.401LeuGlu: 3.401 ± 1.346
5.033LeuPhe: 5.033 ± 1.59
7.346LeuGly: 7.346 ± 1.337
2.993LeuHis: 2.993 ± 0.911
2.449LeuIle: 2.449 ± 0.352
3.537LeuLys: 3.537 ± 0.688
11.291LeuLeu: 11.291 ± 0.879
1.632LeuMet: 1.632 ± 0.701
3.129LeuAsn: 3.129 ± 0.796
6.394LeuPro: 6.394 ± 0.848
3.537LeuGln: 3.537 ± 0.502
5.577LeuArg: 5.577 ± 0.584
8.706LeuSer: 8.706 ± 0.665
7.754LeuThr: 7.754 ± 0.613
10.747LeuVal: 10.747 ± 1.206
1.088LeuTrp: 1.088 ± 0.339
2.313LeuTyr: 2.313 ± 1.2
0.0LeuXaa: 0.0 ± 0.0
Met
2.041MetAla: 2.041 ± 0.392
0.952MetCys: 0.952 ± 1.022
1.088MetAsp: 1.088 ± 0.365
0.272MetGlu: 0.272 ± 0.182
0.136MetPhe: 0.136 ± 0.091
0.816MetGly: 0.816 ± 0.514
0.68MetHis: 0.68 ± 0.293
1.088MetIle: 1.088 ± 0.779
0.272MetLys: 0.272 ± 0.27
2.449MetLeu: 2.449 ± 1.142
0.272MetMet: 0.272 ± 0.27
0.816MetAsn: 0.816 ± 0.204
0.952MetPro: 0.952 ± 0.26
0.68MetGln: 0.68 ± 0.438
1.905MetArg: 1.905 ± 0.414
1.224MetSer: 1.224 ± 0.423
0.136MetThr: 0.136 ± 0.091
1.768MetVal: 1.768 ± 0.472
0.272MetTrp: 0.272 ± 0.182
0.408MetTyr: 0.408 ± 0.421
0.0MetXaa: 0.0 ± 0.0
Asn
2.449AsnAla: 2.449 ± 0.796
1.496AsnCys: 1.496 ± 0.415
1.088AsnAsp: 1.088 ± 0.339
0.952AsnGlu: 0.952 ± 0.296
1.088AsnPhe: 1.088 ± 0.63
2.313AsnGly: 2.313 ± 0.423
0.544AsnHis: 0.544 ± 0.208
1.36AsnIle: 1.36 ± 1.242
1.496AsnLys: 1.496 ± 0.498
4.081AsnLeu: 4.081 ± 0.885
0.408AsnMet: 0.408 ± 0.133
2.313AsnAsn: 2.313 ± 0.304
2.041AsnPro: 2.041 ± 0.895
1.36AsnGln: 1.36 ± 0.463
1.496AsnArg: 1.496 ± 0.476
1.905AsnSer: 1.905 ± 0.742
1.768AsnThr: 1.768 ± 0.836
3.537AsnVal: 3.537 ± 0.708
0.408AsnTrp: 0.408 ± 0.236
1.36AsnTyr: 1.36 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
5.577ProAla: 5.577 ± 1.152
2.313ProCys: 2.313 ± 0.594
2.585ProAsp: 2.585 ± 0.621
3.401ProGlu: 3.401 ± 0.544
2.993ProPhe: 2.993 ± 0.771
4.761ProGly: 4.761 ± 0.519
1.632ProHis: 1.632 ± 0.501
2.449ProIle: 2.449 ± 0.673
3.401ProLys: 3.401 ± 0.539
5.714ProLeu: 5.714 ± 0.659
0.816ProMet: 0.816 ± 0.414
1.224ProAsn: 1.224 ± 0.499
5.169ProPro: 5.169 ± 1.116
0.952ProGln: 0.952 ± 0.613
4.625ProArg: 4.625 ± 0.893
4.489ProSer: 4.489 ± 0.655
3.401ProThr: 3.401 ± 0.847
4.897ProVal: 4.897 ± 0.697
0.816ProTrp: 0.816 ± 0.268
1.496ProTyr: 1.496 ± 0.447
0.0ProXaa: 0.0 ± 0.0
Gln
3.537GlnAla: 3.537 ± 0.654
0.816GlnCys: 0.816 ± 0.265
1.088GlnAsp: 1.088 ± 0.311
0.952GlnGlu: 0.952 ± 0.258
0.68GlnPhe: 0.68 ± 0.549
3.537GlnGly: 3.537 ± 0.617
1.088GlnHis: 1.088 ± 0.276
0.68GlnIle: 0.68 ± 0.278
2.041GlnLys: 2.041 ± 0.478
2.721GlnLeu: 2.721 ± 0.592
0.272GlnMet: 0.272 ± 0.568
1.088GlnAsn: 1.088 ± 0.359
2.857GlnPro: 2.857 ± 1.07
0.0GlnGln: 0.0 ± 0.0
1.632GlnArg: 1.632 ± 0.351
1.088GlnSer: 1.088 ± 0.28
1.496GlnThr: 1.496 ± 0.47
2.313GlnVal: 2.313 ± 0.359
0.544GlnTrp: 0.544 ± 0.407
0.816GlnTyr: 0.816 ± 0.967
0.0GlnXaa: 0.0 ± 0.0
Arg
5.986ArgAla: 5.986 ± 1.024
2.041ArgCys: 2.041 ± 0.435
2.177ArgAsp: 2.177 ± 0.457
1.224ArgGlu: 1.224 ± 0.297
1.36ArgPhe: 1.36 ± 0.301
2.585ArgGly: 2.585 ± 0.631
1.496ArgHis: 1.496 ± 0.488
2.041ArgIle: 2.041 ± 0.568
1.224ArgLys: 1.224 ± 0.3
6.394ArgLeu: 6.394 ± 1.136
1.496ArgMet: 1.496 ± 0.365
1.496ArgAsn: 1.496 ± 0.308
2.585ArgPro: 2.585 ± 0.713
0.816ArgGln: 0.816 ± 0.555
2.993ArgArg: 2.993 ± 1.209
3.945ArgSer: 3.945 ± 0.569
3.673ArgThr: 3.673 ± 0.601
4.489ArgVal: 4.489 ± 1.037
1.36ArgTrp: 1.36 ± 0.666
2.313ArgTyr: 2.313 ± 1.224
0.0ArgXaa: 0.0 ± 0.0
Ser
6.122SerAla: 6.122 ± 1.403
4.081SerCys: 4.081 ± 0.544
2.993SerAsp: 2.993 ± 0.795
3.129SerGlu: 3.129 ± 0.883
3.265SerPhe: 3.265 ± 0.932
4.625SerGly: 4.625 ± 0.987
1.768SerHis: 1.768 ± 0.447
1.905SerIle: 1.905 ± 1.0
3.809SerLys: 3.809 ± 0.908
4.897SerLeu: 4.897 ± 1.539
1.632SerMet: 1.632 ± 0.722
1.632SerAsn: 1.632 ± 0.37
5.305SerPro: 5.305 ± 0.854
1.632SerGln: 1.632 ± 0.374
3.537SerArg: 3.537 ± 0.693
5.441SerSer: 5.441 ± 1.328
6.122SerThr: 6.122 ± 0.942
5.714SerVal: 5.714 ± 0.947
1.632SerTrp: 1.632 ± 0.503
2.177SerTyr: 2.177 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
6.258ThrAla: 6.258 ± 0.511
2.313ThrCys: 2.313 ± 0.523
2.177ThrAsp: 2.177 ± 0.534
2.313ThrGlu: 2.313 ± 0.634
2.177ThrPhe: 2.177 ± 0.451
4.217ThrGly: 4.217 ± 0.935
2.177ThrHis: 2.177 ± 0.728
2.585ThrIle: 2.585 ± 0.633
2.721ThrLys: 2.721 ± 0.467
6.666ThrLeu: 6.666 ± 1.561
1.088ThrMet: 1.088 ± 0.382
1.224ThrAsn: 1.224 ± 0.344
5.85ThrPro: 5.85 ± 0.74
1.496ThrGln: 1.496 ± 0.544
2.313ThrArg: 2.313 ± 0.521
3.265ThrSer: 3.265 ± 1.313
2.041ThrThr: 2.041 ± 0.549
4.081ThrVal: 4.081 ± 0.823
0.136ThrTrp: 0.136 ± 0.267
2.857ThrTyr: 2.857 ± 0.708
0.0ThrXaa: 0.0 ± 0.0
Val
7.618ValAla: 7.618 ± 0.923
2.585ValCys: 2.585 ± 0.503
5.986ValAsp: 5.986 ± 1.41
3.945ValGlu: 3.945 ± 0.669
4.353ValPhe: 4.353 ± 0.384
4.897ValGly: 4.897 ± 0.709
2.585ValHis: 2.585 ± 0.801
2.993ValIle: 2.993 ± 0.704
2.449ValLys: 2.449 ± 0.386
10.067ValLeu: 10.067 ± 0.91
1.632ValMet: 1.632 ± 0.449
4.081ValAsn: 4.081 ± 1.633
6.122ValPro: 6.122 ± 1.156
1.632ValGln: 1.632 ± 0.401
5.169ValArg: 5.169 ± 0.648
7.482ValSer: 7.482 ± 0.994
5.714ValThr: 5.714 ± 1.091
9.386ValVal: 9.386 ± 1.533
0.408ValTrp: 0.408 ± 0.334
2.585ValTyr: 2.585 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
1.768TrpAla: 1.768 ± 0.428
0.544TrpCys: 0.544 ± 0.416
0.408TrpAsp: 0.408 ± 0.274
0.272TrpGlu: 0.272 ± 0.093
0.816TrpPhe: 0.816 ± 0.278
0.816TrpGly: 0.816 ± 0.204
0.272TrpHis: 0.272 ± 0.093
0.408TrpIle: 0.408 ± 0.236
0.68TrpLys: 0.68 ± 0.21
1.632TrpLeu: 1.632 ± 0.49
0.0TrpMet: 0.0 ± 0.0
0.68TrpAsn: 0.68 ± 0.21
0.952TrpPro: 0.952 ± 0.247
0.952TrpGln: 0.952 ± 0.265
0.816TrpArg: 0.816 ± 0.235
1.224TrpSer: 1.224 ± 0.431
0.68TrpThr: 0.68 ± 0.246
0.68TrpVal: 0.68 ± 0.771
0.272TrpTrp: 0.272 ± 0.494
0.816TrpTyr: 0.816 ± 0.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.993TyrAla: 2.993 ± 0.534
1.36TyrCys: 1.36 ± 0.342
1.36TyrAsp: 1.36 ± 1.099
0.952TyrGlu: 0.952 ± 0.387
0.816TyrPhe: 0.816 ± 0.223
1.905TyrGly: 1.905 ± 0.391
1.496TyrHis: 1.496 ± 0.554
1.088TyrIle: 1.088 ± 1.034
0.544TyrLys: 0.544 ± 0.208
4.217TyrLeu: 4.217 ± 0.861
0.544TyrMet: 0.544 ± 0.396
1.768TyrAsn: 1.768 ± 0.328
1.632TyrPro: 1.632 ± 0.429
1.224TyrGln: 1.224 ± 0.452
1.632TyrArg: 1.632 ± 0.539
1.905TyrSer: 1.905 ± 0.474
2.177TyrThr: 2.177 ± 1.043
2.857TyrVal: 2.857 ± 0.42
1.088TyrTrp: 1.088 ± 0.359
1.632TyrTyr: 1.632 ± 0.452
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (7352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski