Amino acid dipepetide frequency for Avian paraavulavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.734AlaAla: 5.734 ± 0.954
1.103AlaCys: 1.103 ± 0.432
3.308AlaAsp: 3.308 ± 0.66
1.985AlaGlu: 1.985 ± 0.294
1.764AlaPhe: 1.764 ± 0.596
3.308AlaGly: 3.308 ± 1.148
1.764AlaHis: 1.764 ± 0.52
5.073AlaIle: 5.073 ± 0.66
2.426AlaLys: 2.426 ± 0.724
6.396AlaLeu: 6.396 ± 2.147
1.764AlaMet: 1.764 ± 1.577
2.647AlaAsn: 2.647 ± 0.615
2.426AlaPro: 2.426 ± 0.609
2.647AlaGln: 2.647 ± 1.003
1.764AlaArg: 1.764 ± 0.387
6.176AlaSer: 6.176 ± 1.263
3.97AlaThr: 3.97 ± 0.325
3.529AlaVal: 3.529 ± 0.939
0.221AlaTrp: 0.221 ± 0.243
2.647AlaTyr: 2.647 ± 0.6
0.221AlaXaa: 0.221 ± 0.135
Cys
0.441CysAla: 0.441 ± 0.221
0.882CysCys: 0.882 ± 0.54
1.323CysAsp: 1.323 ± 0.276
1.103CysGlu: 1.103 ± 0.482
1.323CysPhe: 1.323 ± 0.31
0.662CysGly: 0.662 ± 0.279
1.544CysHis: 1.544 ± 0.578
2.867CysIle: 2.867 ± 0.851
0.882CysLys: 0.882 ± 0.532
1.985CysLeu: 1.985 ± 0.536
0.662CysMet: 0.662 ± 0.639
0.441CysAsn: 0.441 ± 0.224
0.882CysPro: 0.882 ± 0.447
1.985CysGln: 1.985 ± 0.475
1.764CysArg: 1.764 ± 0.502
1.544CysSer: 1.544 ± 0.538
1.764CysThr: 1.764 ± 0.722
1.103CysVal: 1.103 ± 0.309
0.441CysTrp: 0.441 ± 0.224
0.662CysTyr: 0.662 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
3.088AspAla: 3.088 ± 0.692
1.544AspCys: 1.544 ± 0.424
2.426AspAsp: 2.426 ± 0.646
2.867AspGlu: 2.867 ± 1.186
0.882AspPhe: 0.882 ± 0.741
1.764AspGly: 1.764 ± 0.736
1.103AspHis: 1.103 ± 0.482
3.749AspIle: 3.749 ± 0.503
1.764AspLys: 1.764 ± 0.744
5.073AspLeu: 5.073 ± 1.031
1.103AspMet: 1.103 ± 1.185
3.088AspAsn: 3.088 ± 0.585
3.529AspPro: 3.529 ± 0.954
2.647AspGln: 2.647 ± 0.478
2.647AspArg: 2.647 ± 0.838
4.191AspSer: 4.191 ± 0.723
4.411AspThr: 4.411 ± 1.189
2.647AspVal: 2.647 ± 0.407
0.0AspTrp: 0.0 ± 0.0
1.544AspTyr: 1.544 ± 0.698
0.0AspXaa: 0.0 ± 0.0
Glu
2.867GluAla: 2.867 ± 0.709
0.441GluCys: 0.441 ± 0.27
1.544GluAsp: 1.544 ± 0.51
1.985GluGlu: 1.985 ± 0.574
2.426GluPhe: 2.426 ± 0.84
1.985GluGly: 1.985 ± 0.581
0.441GluHis: 0.441 ± 0.224
2.206GluIle: 2.206 ± 0.52
1.985GluLys: 1.985 ± 0.628
7.278GluLeu: 7.278 ± 0.892
0.882GluMet: 0.882 ± 0.395
1.103GluAsn: 1.103 ± 0.301
3.308GluPro: 3.308 ± 1.143
1.544GluGln: 1.544 ± 0.469
0.882GluArg: 0.882 ± 0.269
5.514GluSer: 5.514 ± 0.936
2.647GluThr: 2.647 ± 0.48
2.867GluVal: 2.867 ± 0.991
0.441GluTrp: 0.441 ± 0.27
0.662GluTyr: 0.662 ± 0.256
0.0GluXaa: 0.0 ± 0.0
Phe
0.882PheAla: 0.882 ± 0.335
0.662PheCys: 0.662 ± 0.247
1.103PheAsp: 1.103 ± 0.482
1.764PheGlu: 1.764 ± 0.835
1.103PhePhe: 1.103 ± 0.435
2.206PheGly: 2.206 ± 0.464
0.662PheHis: 0.662 ± 0.247
3.308PheIle: 3.308 ± 0.59
1.985PheLys: 1.985 ± 0.477
4.632PheLeu: 4.632 ± 1.0
0.441PheMet: 0.441 ± 0.235
2.867PheAsn: 2.867 ± 1.163
1.985PhePro: 1.985 ± 0.599
1.323PheGln: 1.323 ± 0.39
1.985PheArg: 1.985 ± 0.627
2.867PheSer: 2.867 ± 1.04
1.985PheThr: 1.985 ± 0.993
1.103PheVal: 1.103 ± 0.325
0.0PheTrp: 0.0 ± 0.0
1.544PheTyr: 1.544 ± 0.613
0.221PheXaa: 0.221 ± 0.243
Gly
2.647GlyAla: 2.647 ± 1.182
1.103GlyCys: 1.103 ± 0.453
3.529GlyAsp: 3.529 ± 0.83
1.103GlyGlu: 1.103 ± 0.516
2.426GlyPhe: 2.426 ± 0.824
2.426GlyGly: 2.426 ± 0.726
0.662GlyHis: 0.662 ± 0.405
3.529GlyIle: 3.529 ± 0.642
1.544GlyLys: 1.544 ± 0.52
6.396GlyLeu: 6.396 ± 0.721
1.323GlyMet: 1.323 ± 0.516
2.426GlyAsn: 2.426 ± 0.541
1.323GlyPro: 1.323 ± 0.567
2.426GlyGln: 2.426 ± 0.726
2.647GlyArg: 2.647 ± 0.521
5.955GlySer: 5.955 ± 1.021
2.867GlyThr: 2.867 ± 1.437
3.529GlyVal: 3.529 ± 1.071
0.221GlyTrp: 0.221 ± 0.135
1.323GlyTyr: 1.323 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
2.647HisAla: 2.647 ± 0.273
0.882HisCys: 0.882 ± 0.338
1.544HisAsp: 1.544 ± 0.481
1.103HisGlu: 1.103 ± 0.325
1.103HisPhe: 1.103 ± 0.675
0.882HisGly: 0.882 ± 0.48
0.882HisHis: 0.882 ± 0.372
2.206HisIle: 2.206 ± 0.612
1.103HisLys: 1.103 ± 0.611
2.867HisLeu: 2.867 ± 0.776
0.662HisMet: 0.662 ± 0.349
1.323HisAsn: 1.323 ± 0.421
1.323HisPro: 1.323 ± 0.421
0.221HisGln: 0.221 ± 0.285
0.441HisArg: 0.441 ± 0.24
2.206HisSer: 2.206 ± 0.375
1.103HisThr: 1.103 ± 0.675
1.544HisVal: 1.544 ± 0.301
0.221HisTrp: 0.221 ± 0.249
0.662HisTyr: 0.662 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.852IleAla: 4.852 ± 1.549
1.764IleCys: 1.764 ± 0.447
3.088IleAsp: 3.088 ± 0.805
4.632IleGlu: 4.632 ± 0.629
3.088IlePhe: 3.088 ± 0.883
3.529IleGly: 3.529 ± 1.002
1.103IleHis: 1.103 ± 0.314
6.837IleIle: 6.837 ± 2.357
3.088IleLys: 3.088 ± 0.903
8.822IleLeu: 8.822 ± 1.866
0.882IleMet: 0.882 ± 0.372
3.97IleAsn: 3.97 ± 0.561
4.411IlePro: 4.411 ± 1.354
3.529IleGln: 3.529 ± 0.929
5.293IleArg: 5.293 ± 1.269
8.602IleSer: 8.602 ± 1.747
6.176IleThr: 6.176 ± 1.205
3.529IleVal: 3.529 ± 1.028
0.662IleTrp: 0.662 ± 0.349
3.088IleTyr: 3.088 ± 1.043
0.0IleXaa: 0.0 ± 0.0
Lys
2.426LysAla: 2.426 ± 0.519
0.441LysCys: 0.441 ± 0.27
2.647LysAsp: 2.647 ± 0.952
2.206LysGlu: 2.206 ± 0.664
0.662LysPhe: 0.662 ± 0.405
2.206LysGly: 2.206 ± 0.948
0.882LysHis: 0.882 ± 0.48
1.544LysIle: 1.544 ± 0.367
1.544LysLys: 1.544 ± 1.307
5.073LysLeu: 5.073 ± 1.331
0.662LysMet: 0.662 ± 0.35
1.985LysAsn: 1.985 ± 0.57
0.882LysPro: 0.882 ± 0.282
2.426LysGln: 2.426 ± 0.541
2.426LysArg: 2.426 ± 0.418
3.749LysSer: 3.749 ± 0.842
3.529LysThr: 3.529 ± 1.162
3.97LysVal: 3.97 ± 0.953
0.662LysTrp: 0.662 ± 0.247
1.764LysTyr: 1.764 ± 0.62
0.0LysXaa: 0.0 ± 0.0
Leu
8.822LeuAla: 8.822 ± 1.037
3.088LeuCys: 3.088 ± 1.229
4.411LeuAsp: 4.411 ± 1.124
5.293LeuGlu: 5.293 ± 1.113
4.632LeuPhe: 4.632 ± 0.685
6.617LeuGly: 6.617 ± 1.786
2.647LeuHis: 2.647 ± 0.359
9.704LeuIle: 9.704 ± 1.816
5.073LeuLys: 5.073 ± 1.402
13.895LeuLeu: 13.895 ± 2.08
2.206LeuMet: 2.206 ± 0.864
7.94LeuAsn: 7.94 ± 1.395
3.97LeuPro: 3.97 ± 0.979
3.97LeuGln: 3.97 ± 1.058
4.191LeuArg: 4.191 ± 0.622
10.807LeuSer: 10.807 ± 2.108
9.484LeuThr: 9.484 ± 0.714
4.411LeuVal: 4.411 ± 1.262
0.662LeuTrp: 0.662 ± 0.247
3.308LeuTyr: 3.308 ± 0.719
0.0LeuXaa: 0.0 ± 0.0
Met
1.985MetAla: 1.985 ± 1.226
0.441MetCys: 0.441 ± 0.221
0.441MetAsp: 0.441 ± 0.24
0.882MetGlu: 0.882 ± 0.253
0.662MetPhe: 0.662 ± 0.504
1.103MetGly: 1.103 ± 0.432
0.662MetHis: 0.662 ± 0.349
1.985MetIle: 1.985 ± 0.984
0.0MetLys: 0.0 ± 0.0
1.323MetLeu: 1.323 ± 0.359
0.882MetMet: 0.882 ± 0.514
0.0MetAsn: 0.0 ± 0.0
0.882MetPro: 0.882 ± 0.335
1.544MetGln: 1.544 ± 0.748
2.206MetArg: 2.206 ± 0.746
0.882MetSer: 0.882 ± 0.447
1.764MetThr: 1.764 ± 0.493
1.323MetVal: 1.323 ± 0.7
0.221MetTrp: 0.221 ± 0.135
1.544MetTyr: 1.544 ± 0.487
0.0MetXaa: 0.0 ± 0.0
Asn
2.206AsnAla: 2.206 ± 0.455
1.764AsnCys: 1.764 ± 0.396
3.529AsnAsp: 3.529 ± 0.734
1.764AsnGlu: 1.764 ± 0.502
1.985AsnPhe: 1.985 ± 0.584
1.764AsnGly: 1.764 ± 0.203
1.544AsnHis: 1.544 ± 0.484
5.073AsnIle: 5.073 ± 0.741
1.323AsnLys: 1.323 ± 0.319
7.94AsnLeu: 7.94 ± 1.602
0.441AsnMet: 0.441 ± 0.419
3.308AsnAsn: 3.308 ± 0.891
3.308AsnPro: 3.308 ± 0.232
2.206AsnGln: 2.206 ± 0.701
2.867AsnArg: 2.867 ± 0.446
3.308AsnSer: 3.308 ± 0.885
2.206AsnThr: 2.206 ± 0.808
1.323AsnVal: 1.323 ± 0.265
0.662AsnTrp: 0.662 ± 0.405
1.985AsnTyr: 1.985 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
3.088ProAla: 3.088 ± 0.704
0.221ProCys: 0.221 ± 0.249
1.764ProAsp: 1.764 ± 0.501
2.647ProGlu: 2.647 ± 0.69
1.985ProPhe: 1.985 ± 0.448
2.867ProGly: 2.867 ± 1.173
1.985ProHis: 1.985 ± 0.466
3.749ProIle: 3.749 ± 0.921
2.867ProLys: 2.867 ± 0.953
4.632ProLeu: 4.632 ± 0.804
1.103ProMet: 1.103 ± 0.412
2.867ProAsn: 2.867 ± 0.647
1.544ProPro: 1.544 ± 0.328
2.206ProGln: 2.206 ± 0.599
1.764ProArg: 1.764 ± 0.359
5.955ProSer: 5.955 ± 1.78
4.191ProThr: 4.191 ± 1.382
1.764ProVal: 1.764 ± 0.814
0.221ProTrp: 0.221 ± 0.135
1.103ProTyr: 1.103 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
2.426GlnAla: 2.426 ± 0.54
0.882GlnCys: 0.882 ± 0.335
2.426GlnAsp: 2.426 ± 1.05
1.323GlnGlu: 1.323 ± 0.484
1.103GlnPhe: 1.103 ± 0.472
2.206GlnGly: 2.206 ± 0.29
1.764GlnHis: 1.764 ± 0.68
3.529GlnIle: 3.529 ± 0.925
2.426GlnLys: 2.426 ± 0.821
6.617GlnLeu: 6.617 ± 1.591
0.882GlnMet: 0.882 ± 0.249
2.206GlnAsn: 2.206 ± 0.658
1.764GlnPro: 1.764 ± 0.441
2.206GlnGln: 2.206 ± 0.547
1.985GlnArg: 1.985 ± 0.507
4.852GlnSer: 4.852 ± 1.425
1.323GlnThr: 1.323 ± 0.528
3.749GlnVal: 3.749 ± 0.435
0.0GlnTrp: 0.0 ± 0.0
1.985GlnTyr: 1.985 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.867ArgAla: 2.867 ± 0.577
1.544ArgCys: 1.544 ± 0.52
3.308ArgAsp: 3.308 ± 0.59
1.985ArgGlu: 1.985 ± 0.993
2.426ArgPhe: 2.426 ± 1.451
1.544ArgGly: 1.544 ± 0.231
1.323ArgHis: 1.323 ± 0.597
4.852ArgIle: 4.852 ± 0.712
2.647ArgLys: 2.647 ± 0.622
6.396ArgLeu: 6.396 ± 0.698
1.323ArgMet: 1.323 ± 0.433
2.426ArgAsn: 2.426 ± 0.37
1.103ArgPro: 1.103 ± 0.72
1.985ArgGln: 1.985 ± 0.945
3.308ArgArg: 3.308 ± 1.145
3.529ArgSer: 3.529 ± 0.438
3.308ArgThr: 3.308 ± 0.733
1.544ArgVal: 1.544 ± 0.545
0.441ArgTrp: 0.441 ± 0.235
1.323ArgTyr: 1.323 ± 0.562
0.0ArgXaa: 0.0 ± 0.0
Ser
4.632SerAla: 4.632 ± 1.47
2.206SerCys: 2.206 ± 0.643
4.411SerAsp: 4.411 ± 1.077
4.852SerGlu: 4.852 ± 0.58
1.985SerPhe: 1.985 ± 0.56
5.073SerGly: 5.073 ± 1.15
2.206SerHis: 2.206 ± 0.449
7.719SerIle: 7.719 ± 1.184
4.411SerLys: 4.411 ± 0.665
10.146SerLeu: 10.146 ± 0.699
1.103SerMet: 1.103 ± 0.297
5.073SerAsn: 5.073 ± 0.93
5.955SerPro: 5.955 ± 1.176
3.749SerGln: 3.749 ± 1.909
4.852SerArg: 4.852 ± 1.354
9.925SerSer: 9.925 ± 2.768
6.617SerThr: 6.617 ± 1.079
5.293SerVal: 5.293 ± 0.544
1.103SerTrp: 1.103 ± 0.309
3.308SerTyr: 3.308 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
2.867ThrAla: 2.867 ± 1.252
2.206ThrCys: 2.206 ± 0.864
4.632ThrAsp: 4.632 ± 1.265
2.647ThrGlu: 2.647 ± 1.136
1.764ThrPhe: 1.764 ± 0.622
4.852ThrGly: 4.852 ± 1.441
1.103ThrHis: 1.103 ± 0.543
7.278ThrIle: 7.278 ± 0.942
2.867ThrLys: 2.867 ± 0.555
6.176ThrLeu: 6.176 ± 0.98
1.985ThrMet: 1.985 ± 0.474
2.426ThrAsn: 2.426 ± 0.381
3.749ThrPro: 3.749 ± 1.041
5.514ThrGln: 5.514 ± 0.585
3.749ThrArg: 3.749 ± 0.45
4.191ThrSer: 4.191 ± 0.711
6.837ThrThr: 6.837 ± 0.76
3.308ThrVal: 3.308 ± 0.48
0.662ThrTrp: 0.662 ± 0.405
2.206ThrTyr: 2.206 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
3.97ValAla: 3.97 ± 0.784
1.544ValCys: 1.544 ± 0.683
2.206ValAsp: 2.206 ± 0.562
1.985ValGlu: 1.985 ± 0.935
1.764ValPhe: 1.764 ± 0.203
1.985ValGly: 1.985 ± 0.335
0.882ValHis: 0.882 ± 0.335
3.308ValIle: 3.308 ± 1.181
2.426ValLys: 2.426 ± 0.438
4.632ValLeu: 4.632 ± 0.853
1.323ValMet: 1.323 ± 0.265
2.647ValAsn: 2.647 ± 0.766
4.411ValPro: 4.411 ± 1.0
1.323ValGln: 1.323 ± 0.272
1.985ValArg: 1.985 ± 0.628
6.396ValSer: 6.396 ± 1.217
4.411ValThr: 4.411 ± 0.404
3.088ValVal: 3.088 ± 0.772
0.882ValTrp: 0.882 ± 0.266
0.882ValTyr: 0.882 ± 0.368
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.247
0.221TrpCys: 0.221 ± 0.249
0.221TrpAsp: 0.221 ± 0.135
0.0TrpGlu: 0.0 ± 0.0
0.441TrpPhe: 0.441 ± 0.27
0.441TrpGly: 0.441 ± 0.207
0.441TrpHis: 0.441 ± 0.221
1.103TrpIle: 1.103 ± 0.442
0.221TrpLys: 0.221 ± 0.248
0.441TrpLeu: 0.441 ± 0.327
0.0TrpMet: 0.0 ± 0.0
0.221TrpAsn: 0.221 ± 0.285
0.662TrpPro: 0.662 ± 0.454
0.0TrpGln: 0.0 ± 0.0
0.441TrpArg: 0.441 ± 0.27
0.441TrpSer: 0.441 ± 0.27
0.662TrpThr: 0.662 ± 0.405
0.662TrpVal: 0.662 ± 0.405
0.221TrpTrp: 0.221 ± 0.249
0.441TrpTyr: 0.441 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.764TyrAla: 1.764 ± 0.973
1.323TyrCys: 1.323 ± 0.421
2.206TyrAsp: 2.206 ± 0.607
0.882TyrGlu: 0.882 ± 0.335
1.103TyrPhe: 1.103 ± 0.48
1.985TyrGly: 1.985 ± 0.335
1.323TyrHis: 1.323 ± 0.332
1.323TyrIle: 1.323 ± 0.265
1.323TyrLys: 1.323 ± 0.562
4.411TyrLeu: 4.411 ± 1.094
0.882TyrMet: 0.882 ± 0.235
1.544TyrAsn: 1.544 ± 0.39
0.882TyrPro: 0.882 ± 0.457
2.206TyrGln: 2.206 ± 0.663
1.985TyrArg: 1.985 ± 1.124
3.529TyrSer: 3.529 ± 0.97
1.764TyrThr: 1.764 ± 0.886
1.544TyrVal: 1.544 ± 0.464
0.0TyrTrp: 0.0 ± 0.0
2.426TyrTyr: 2.426 ± 0.884
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.221XaaPro: 0.221 ± 0.243
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.221XaaVal: 0.221 ± 0.135
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4535 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski