Amino acid dipepetide frequency for Peste-des-petits-ruminants virus (PPRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.088AlaAla: 3.088 ± 0.817
1.737AlaCys: 1.737 ± 0.372
2.895AlaAsp: 2.895 ± 0.964
3.281AlaGlu: 3.281 ± 1.094
1.93AlaPhe: 1.93 ± 0.398
3.86AlaGly: 3.86 ± 0.962
0.965AlaHis: 0.965 ± 0.466
4.632AlaIle: 4.632 ± 1.182
3.088AlaLys: 3.088 ± 0.651
4.825AlaLeu: 4.825 ± 1.162
0.965AlaMet: 0.965 ± 0.425
2.316AlaAsn: 2.316 ± 0.412
1.544AlaPro: 1.544 ± 0.566
2.895AlaGln: 2.895 ± 0.58
2.895AlaArg: 2.895 ± 0.469
5.404AlaSer: 5.404 ± 1.115
3.088AlaThr: 3.088 ± 0.477
4.439AlaVal: 4.439 ± 0.741
0.386AlaTrp: 0.386 ± 0.219
2.123AlaTyr: 2.123 ± 0.292
0.0AlaXaa: 0.0 ± 0.0
Cys
0.772CysAla: 0.772 ± 0.385
0.579CysCys: 0.579 ± 0.386
0.579CysAsp: 0.579 ± 0.266
1.158CysGlu: 1.158 ± 0.318
0.965CysPhe: 0.965 ± 0.238
0.772CysGly: 0.772 ± 0.507
0.579CysHis: 0.579 ± 0.258
0.965CysIle: 0.965 ± 0.453
0.579CysLys: 0.579 ± 0.344
1.544CysLeu: 1.544 ± 0.667
0.193CysMet: 0.193 ± 0.212
0.772CysAsn: 0.772 ± 0.397
1.351CysPro: 1.351 ± 0.421
0.965CysGln: 0.965 ± 0.453
0.772CysArg: 0.772 ± 0.415
1.351CysSer: 1.351 ± 0.326
0.772CysThr: 0.772 ± 0.244
1.158CysVal: 1.158 ± 0.57
0.0CysTrp: 0.0 ± 0.0
1.544CysTyr: 1.544 ± 0.454
0.0CysXaa: 0.0 ± 0.0
Asp
2.123AspAla: 2.123 ± 0.36
0.772AspCys: 0.772 ± 0.354
4.632AspAsp: 4.632 ± 1.509
2.702AspGlu: 2.702 ± 0.743
1.737AspPhe: 1.737 ± 0.528
1.93AspGly: 1.93 ± 0.527
2.123AspHis: 2.123 ± 0.846
4.053AspIle: 4.053 ± 0.718
3.281AspLys: 3.281 ± 0.672
7.528AspLeu: 7.528 ± 1.084
0.965AspMet: 0.965 ± 0.288
3.474AspAsn: 3.474 ± 0.522
4.632AspPro: 4.632 ± 0.867
1.544AspGln: 1.544 ± 0.288
3.088AspArg: 3.088 ± 0.679
4.632AspSer: 4.632 ± 1.478
2.316AspThr: 2.316 ± 0.332
5.404AspVal: 5.404 ± 1.221
0.386AspTrp: 0.386 ± 0.238
1.544AspTyr: 1.544 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
3.667GluAla: 3.667 ± 0.756
1.737GluCys: 1.737 ± 0.553
2.895GluAsp: 2.895 ± 0.82
3.474GluGlu: 3.474 ± 0.794
2.123GluPhe: 2.123 ± 0.716
5.211GluGly: 5.211 ± 1.813
1.351GluHis: 1.351 ± 0.513
5.597GluIle: 5.597 ± 1.255
3.667GluLys: 3.667 ± 0.736
6.176GluLeu: 6.176 ± 0.435
0.965GluMet: 0.965 ± 0.403
2.702GluAsn: 2.702 ± 0.65
1.351GluPro: 1.351 ± 0.413
0.579GluGln: 0.579 ± 0.258
2.509GluArg: 2.509 ± 1.282
4.825GluSer: 4.825 ± 1.019
4.439GluThr: 4.439 ± 1.405
4.825GluVal: 4.825 ± 0.834
0.772GluTrp: 0.772 ± 0.342
1.544GluTyr: 1.544 ± 0.425
0.0GluXaa: 0.0 ± 0.0
Phe
1.737PheAla: 1.737 ± 0.416
0.772PheCys: 0.772 ± 0.399
1.544PheAsp: 1.544 ± 0.283
1.544PheGlu: 1.544 ± 0.549
1.158PhePhe: 1.158 ± 0.583
1.93PheGly: 1.93 ± 0.571
0.579PheHis: 0.579 ± 0.249
2.895PheIle: 2.895 ± 0.894
1.737PheLys: 1.737 ± 0.436
2.895PheLeu: 2.895 ± 0.672
0.772PheMet: 0.772 ± 0.462
0.965PheAsn: 0.965 ± 0.37
0.965PhePro: 0.965 ± 0.641
0.579PheGln: 0.579 ± 0.282
3.281PheArg: 3.281 ± 0.894
2.509PheSer: 2.509 ± 0.737
1.544PheThr: 1.544 ± 0.431
2.702PheVal: 2.702 ± 0.94
0.772PheTrp: 0.772 ± 0.507
0.386PheTyr: 0.386 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
1.93GlyAla: 1.93 ± 0.663
1.158GlyCys: 1.158 ± 0.583
3.667GlyAsp: 3.667 ± 0.61
4.825GlyGlu: 4.825 ± 0.943
1.93GlyPhe: 1.93 ± 0.735
2.895GlyGly: 2.895 ± 0.514
1.158GlyHis: 1.158 ± 0.257
3.667GlyIle: 3.667 ± 0.809
1.93GlyLys: 1.93 ± 0.447
9.651GlyLeu: 9.651 ± 0.767
1.544GlyMet: 1.544 ± 0.688
3.088GlyAsn: 3.088 ± 1.367
2.509GlyPro: 2.509 ± 0.795
1.93GlyGln: 1.93 ± 0.819
4.439GlyArg: 4.439 ± 1.415
6.369GlySer: 6.369 ± 1.48
5.404GlyThr: 5.404 ± 1.293
5.597GlyVal: 5.597 ± 1.176
0.386GlyTrp: 0.386 ± 0.202
1.93GlyTyr: 1.93 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.33
0.579HisCys: 0.579 ± 0.381
1.544HisAsp: 1.544 ± 0.655
1.544HisGlu: 1.544 ± 0.548
0.579HisPhe: 0.579 ± 0.258
1.351HisGly: 1.351 ± 0.46
0.579HisHis: 0.579 ± 0.279
1.544HisIle: 1.544 ± 0.552
1.351HisLys: 1.351 ± 0.792
2.123HisLeu: 2.123 ± 0.789
0.965HisMet: 0.965 ± 0.466
0.965HisAsn: 0.965 ± 0.443
2.123HisPro: 2.123 ± 0.846
1.351HisGln: 1.351 ± 0.421
2.123HisArg: 2.123 ± 0.619
0.772HisSer: 0.772 ± 0.354
0.579HisThr: 0.579 ± 0.24
1.93HisVal: 1.93 ± 0.625
0.193HisTrp: 0.193 ± 0.213
0.772HisTyr: 0.772 ± 0.507
0.0HisXaa: 0.0 ± 0.0
Ile
5.404IleAla: 5.404 ± 0.482
1.158IleCys: 1.158 ± 0.321
3.667IleAsp: 3.667 ± 0.247
5.79IleGlu: 5.79 ± 0.677
1.351IlePhe: 1.351 ± 0.38
4.246IleGly: 4.246 ± 0.782
1.737IleHis: 1.737 ± 0.459
4.246IleIle: 4.246 ± 0.8
5.211IleLys: 5.211 ± 0.747
8.107IleLeu: 8.107 ± 0.991
0.772IleMet: 0.772 ± 0.263
4.053IleAsn: 4.053 ± 0.395
2.702IlePro: 2.702 ± 0.831
3.474IleGln: 3.474 ± 0.816
4.053IleArg: 4.053 ± 0.491
5.983IleSer: 5.983 ± 1.192
5.018IleThr: 5.018 ± 1.066
2.702IleVal: 2.702 ± 0.561
0.0IleTrp: 0.0 ± 0.0
2.123IleTyr: 2.123 ± 0.688
0.0IleXaa: 0.0 ± 0.0
Lys
2.316LysAla: 2.316 ± 0.588
0.579LysCys: 0.579 ± 0.386
4.053LysAsp: 4.053 ± 0.791
3.474LysGlu: 3.474 ± 0.995
1.158LysPhe: 1.158 ± 0.469
4.439LysGly: 4.439 ± 0.637
1.158LysHis: 1.158 ± 0.273
3.86LysIle: 3.86 ± 0.935
3.281LysLys: 3.281 ± 0.687
4.825LysLeu: 4.825 ± 0.683
1.93LysMet: 1.93 ± 0.667
1.93LysAsn: 1.93 ± 0.409
1.544LysPro: 1.544 ± 0.513
1.737LysGln: 1.737 ± 0.762
2.895LysArg: 2.895 ± 0.453
5.79LysSer: 5.79 ± 1.261
2.123LysThr: 2.123 ± 1.13
2.702LysVal: 2.702 ± 0.566
0.0LysTrp: 0.0 ± 0.0
1.93LysTyr: 1.93 ± 0.572
0.0LysXaa: 0.0 ± 0.0
Leu
8.3LeuAla: 8.3 ± 0.733
1.351LeuCys: 1.351 ± 0.352
6.755LeuAsp: 6.755 ± 0.571
5.597LeuGlu: 5.597 ± 1.122
4.246LeuPhe: 4.246 ± 0.611
6.369LeuGly: 6.369 ± 1.003
2.895LeuHis: 2.895 ± 0.772
5.79LeuIle: 5.79 ± 1.452
7.141LeuLys: 7.141 ± 1.203
8.879LeuLeu: 8.879 ± 1.029
2.702LeuMet: 2.702 ± 0.458
4.053LeuAsn: 4.053 ± 0.817
2.316LeuPro: 2.316 ± 0.756
2.123LeuGln: 2.123 ± 0.457
6.369LeuArg: 6.369 ± 0.834
9.265LeuSer: 9.265 ± 1.163
8.107LeuThr: 8.107 ± 1.452
6.369LeuVal: 6.369 ± 1.113
0.772LeuTrp: 0.772 ± 0.302
3.667LeuTyr: 3.667 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
1.351MetAla: 1.351 ± 0.41
0.0MetCys: 0.0 ± 0.0
0.965MetAsp: 0.965 ± 0.328
0.386MetGlu: 0.386 ± 0.3
0.579MetPhe: 0.579 ± 0.291
1.737MetGly: 1.737 ± 0.439
0.193MetHis: 0.193 ± 0.127
2.702MetIle: 2.702 ± 0.658
0.965MetLys: 0.965 ± 0.331
1.93MetLeu: 1.93 ± 0.525
0.386MetMet: 0.386 ± 0.238
1.158MetAsn: 1.158 ± 0.251
1.158MetPro: 1.158 ± 0.251
0.193MetGln: 0.193 ± 0.208
0.579MetArg: 0.579 ± 0.381
2.895MetSer: 2.895 ± 0.548
2.316MetThr: 2.316 ± 0.873
1.158MetVal: 1.158 ± 0.655
0.386MetTrp: 0.386 ± 0.238
1.351MetTyr: 1.351 ± 0.525
0.0MetXaa: 0.0 ± 0.0
Asn
2.316AsnAla: 2.316 ± 0.582
0.772AsnCys: 0.772 ± 0.385
2.123AsnAsp: 2.123 ± 0.859
2.123AsnGlu: 2.123 ± 0.425
1.158AsnPhe: 1.158 ± 0.307
2.702AsnGly: 2.702 ± 0.93
1.158AsnHis: 1.158 ± 0.368
3.474AsnIle: 3.474 ± 0.699
2.509AsnLys: 2.509 ± 0.575
3.667AsnLeu: 3.667 ± 0.565
1.158AsnMet: 1.158 ± 0.565
1.158AsnAsn: 1.158 ± 0.272
3.474AsnPro: 3.474 ± 0.751
1.737AsnGln: 1.737 ± 0.377
1.544AsnArg: 1.544 ± 0.429
2.702AsnSer: 2.702 ± 0.731
2.509AsnThr: 2.509 ± 0.47
0.772AsnVal: 0.772 ± 0.25
1.351AsnTrp: 1.351 ± 0.256
1.93AsnTyr: 1.93 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
2.316ProAla: 2.316 ± 0.623
0.0ProCys: 0.0 ± 0.0
5.018ProAsp: 5.018 ± 1.903
2.509ProGlu: 2.509 ± 0.54
0.579ProPhe: 0.579 ± 0.24
2.895ProGly: 2.895 ± 0.756
0.772ProHis: 0.772 ± 0.598
3.281ProIle: 3.281 ± 0.467
2.123ProLys: 2.123 ± 0.564
4.439ProLeu: 4.439 ± 1.411
1.351ProMet: 1.351 ± 0.354
1.544ProAsn: 1.544 ± 0.451
3.088ProPro: 3.088 ± 0.925
1.351ProGln: 1.351 ± 0.905
4.053ProArg: 4.053 ± 1.122
4.825ProSer: 4.825 ± 1.034
3.088ProThr: 3.088 ± 1.223
2.316ProVal: 2.316 ± 0.446
0.772ProTrp: 0.772 ± 0.397
2.123ProTyr: 2.123 ± 0.864
0.0ProXaa: 0.0 ± 0.0
Gln
2.895GlnAla: 2.895 ± 1.269
0.579GlnCys: 0.579 ± 0.381
1.737GlnAsp: 1.737 ± 0.479
2.509GlnGlu: 2.509 ± 1.327
0.579GlnPhe: 0.579 ± 0.392
1.93GlnGly: 1.93 ± 0.342
0.386GlnHis: 0.386 ± 0.202
2.123GlnIle: 2.123 ± 0.706
0.965GlnLys: 0.965 ± 0.285
1.93GlnLeu: 1.93 ± 0.664
0.965GlnMet: 0.965 ± 0.503
1.737GlnAsn: 1.737 ± 0.525
1.544GlnPro: 1.544 ± 0.485
0.965GlnGln: 0.965 ± 0.373
2.702GlnArg: 2.702 ± 0.517
3.088GlnSer: 3.088 ± 0.462
1.544GlnThr: 1.544 ± 0.438
2.509GlnVal: 2.509 ± 0.504
0.386GlnTrp: 0.386 ± 0.254
0.193GlnTyr: 0.193 ± 0.127
0.0GlnXaa: 0.0 ± 0.0
Arg
3.281ArgAla: 3.281 ± 0.879
1.351ArgCys: 1.351 ± 0.268
3.281ArgAsp: 3.281 ± 0.546
4.053ArgGlu: 4.053 ± 0.759
1.93ArgPhe: 1.93 ± 0.363
6.176ArgGly: 6.176 ± 1.243
1.351ArgHis: 1.351 ± 0.46
3.474ArgIle: 3.474 ± 1.098
1.158ArgLys: 1.158 ± 0.486
6.755ArgLeu: 6.755 ± 1.293
1.351ArgMet: 1.351 ± 0.581
1.544ArgAsn: 1.544 ± 0.28
2.123ArgPro: 2.123 ± 0.743
1.158ArgGln: 1.158 ± 0.595
3.86ArgArg: 3.86 ± 0.987
6.369ArgSer: 6.369 ± 1.067
2.702ArgThr: 2.702 ± 0.488
3.86ArgVal: 3.86 ± 0.749
0.386ArgTrp: 0.386 ± 0.28
2.895ArgTyr: 2.895 ± 0.682
0.0ArgXaa: 0.0 ± 0.0
Ser
4.825SerAla: 4.825 ± 1.192
1.737SerCys: 1.737 ± 0.583
4.632SerAsp: 4.632 ± 1.345
4.825SerGlu: 4.825 ± 1.181
2.509SerPhe: 2.509 ± 0.951
6.948SerGly: 6.948 ± 1.458
3.474SerHis: 3.474 ± 0.706
6.369SerIle: 6.369 ± 0.846
4.632SerLys: 4.632 ± 1.461
9.651SerLeu: 9.651 ± 1.241
1.544SerMet: 1.544 ± 0.672
2.509SerAsn: 2.509 ± 0.945
4.246SerPro: 4.246 ± 0.791
2.895SerGln: 2.895 ± 0.743
4.053SerArg: 4.053 ± 0.591
5.79SerSer: 5.79 ± 1.022
5.597SerThr: 5.597 ± 0.874
5.404SerVal: 5.404 ± 1.16
1.544SerTrp: 1.544 ± 0.641
3.088SerTyr: 3.088 ± 0.854
0.0SerXaa: 0.0 ± 0.0
Thr
4.246ThrAla: 4.246 ± 0.754
0.579ThrCys: 0.579 ± 0.291
3.088ThrAsp: 3.088 ± 0.58
3.86ThrGlu: 3.86 ± 0.435
1.737ThrPhe: 1.737 ± 0.471
4.439ThrGly: 4.439 ± 0.803
0.965ThrHis: 0.965 ± 0.341
6.176ThrIle: 6.176 ± 0.672
2.509ThrLys: 2.509 ± 0.826
6.562ThrLeu: 6.562 ± 1.517
1.351ThrMet: 1.351 ± 0.71
2.509ThrAsn: 2.509 ± 0.653
3.86ThrPro: 3.86 ± 1.067
2.702ThrGln: 2.702 ± 0.724
3.667ThrArg: 3.667 ± 0.736
3.474ThrSer: 3.474 ± 0.926
3.088ThrThr: 3.088 ± 0.681
3.86ThrVal: 3.86 ± 1.095
0.386ThrTrp: 0.386 ± 0.254
1.93ThrTyr: 1.93 ± 0.556
0.0ThrXaa: 0.0 ± 0.0
Val
3.088ValAla: 3.088 ± 1.029
0.579ValCys: 0.579 ± 0.254
2.316ValAsp: 2.316 ± 0.607
4.632ValGlu: 4.632 ± 1.004
2.316ValPhe: 2.316 ± 0.67
4.246ValGly: 4.246 ± 0.874
0.965ValHis: 0.965 ± 0.379
4.632ValIle: 4.632 ± 0.584
3.667ValLys: 3.667 ± 1.105
5.79ValLeu: 5.79 ± 1.063
1.544ValMet: 1.544 ± 0.493
1.737ValAsn: 1.737 ± 0.496
4.825ValPro: 4.825 ± 0.914
2.316ValGln: 2.316 ± 0.6
3.474ValArg: 3.474 ± 0.557
5.018ValSer: 5.018 ± 1.133
5.018ValThr: 5.018 ± 0.701
3.86ValVal: 3.86 ± 0.665
0.772ValTrp: 0.772 ± 0.648
3.474ValTyr: 3.474 ± 0.683
0.0ValXaa: 0.0 ± 0.0
Trp
0.772TrpAla: 0.772 ± 0.507
0.386TrpCys: 0.386 ± 0.425
0.772TrpAsp: 0.772 ± 0.378
0.193TrpGlu: 0.193 ± 0.221
0.965TrpPhe: 0.965 ± 0.477
0.386TrpGly: 0.386 ± 0.192
0.193TrpHis: 0.193 ± 0.26
0.193TrpIle: 0.193 ± 0.207
0.193TrpLys: 0.193 ± 0.127
0.772TrpLeu: 0.772 ± 0.252
0.193TrpMet: 0.193 ± 0.127
0.193TrpAsn: 0.193 ± 0.127
0.386TrpPro: 0.386 ± 0.254
0.0TrpGln: 0.0 ± 0.0
1.544TrpArg: 1.544 ± 0.565
0.965TrpSer: 0.965 ± 0.323
0.386TrpThr: 0.386 ± 0.238
0.772TrpVal: 0.772 ± 0.319
0.0TrpTrp: 0.0 ± 0.0
0.386TrpTyr: 0.386 ± 0.295
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.579TyrAla: 0.579 ± 0.227
0.965TyrCys: 0.965 ± 0.488
2.702TyrAsp: 2.702 ± 0.675
1.93TyrGlu: 1.93 ± 0.725
1.737TyrPhe: 1.737 ± 0.545
1.544TyrGly: 1.544 ± 0.402
1.737TyrHis: 1.737 ± 0.7
2.123TyrIle: 2.123 ± 0.48
1.737TyrLys: 1.737 ± 0.54
5.018TyrLeu: 5.018 ± 2.08
0.579TyrMet: 0.579 ± 0.435
1.93TyrAsn: 1.93 ± 0.381
2.702TyrPro: 2.702 ± 0.762
0.772TyrGln: 0.772 ± 0.365
1.158TyrArg: 1.158 ± 0.273
4.439TyrSer: 4.439 ± 1.421
1.351TyrThr: 1.351 ± 0.534
1.93TyrVal: 1.93 ± 0.826
0.0TyrTrp: 0.0 ± 0.0
1.158TyrTyr: 1.158 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski