Amino acid dipepetide frequency for Penguin siadenovirus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.793AlaAla: 3.793 ± 0.89
1.046AlaCys: 1.046 ± 0.358
3.663AlaAsp: 3.663 ± 0.857
3.139AlaGlu: 3.139 ± 0.911
2.485AlaPhe: 2.485 ± 0.573
2.224AlaGly: 2.224 ± 0.661
0.785AlaHis: 0.785 ± 0.266
3.793AlaIle: 3.793 ± 0.425
3.009AlaLys: 3.009 ± 0.532
5.232AlaLeu: 5.232 ± 1.111
1.439AlaMet: 1.439 ± 0.522
3.27AlaAsn: 3.27 ± 0.69
2.878AlaPro: 2.878 ± 0.864
1.57AlaGln: 1.57 ± 0.393
1.962AlaArg: 1.962 ± 0.459
3.009AlaSer: 3.009 ± 0.623
3.27AlaThr: 3.27 ± 0.756
4.971AlaVal: 4.971 ± 0.964
0.523AlaTrp: 0.523 ± 0.317
2.093AlaTyr: 2.093 ± 0.762
0.0AlaXaa: 0.0 ± 0.0
Cys
1.308CysAla: 1.308 ± 0.465
0.785CysCys: 0.785 ± 0.329
1.177CysAsp: 1.177 ± 0.529
0.262CysGlu: 0.262 ± 0.219
1.046CysPhe: 1.046 ± 0.436
1.177CysGly: 1.177 ± 0.374
0.785CysHis: 0.785 ± 0.329
1.439CysIle: 1.439 ± 0.533
1.57CysLys: 1.57 ± 0.511
1.439CysLeu: 1.439 ± 0.762
1.046CysMet: 1.046 ± 0.311
1.962CysAsn: 1.962 ± 0.782
0.654CysPro: 0.654 ± 0.254
0.916CysGln: 0.916 ± 0.264
0.654CysArg: 0.654 ± 0.26
1.831CysSer: 1.831 ± 0.454
1.439CysThr: 1.439 ± 0.432
0.392CysVal: 0.392 ± 0.19
0.262CysTrp: 0.262 ± 0.157
0.916CysTyr: 0.916 ± 0.356
0.0CysXaa: 0.0 ± 0.0
Asp
2.354AspAla: 2.354 ± 0.402
1.439AspCys: 1.439 ± 0.471
2.354AspAsp: 2.354 ± 0.432
3.139AspGlu: 3.139 ± 0.785
3.401AspPhe: 3.401 ± 0.776
1.962AspGly: 1.962 ± 0.429
1.046AspHis: 1.046 ± 0.266
5.101AspIle: 5.101 ± 0.71
2.354AspLys: 2.354 ± 0.493
6.671AspLeu: 6.671 ± 0.955
0.654AspMet: 0.654 ± 0.208
3.27AspAsn: 3.27 ± 0.519
3.009AspPro: 3.009 ± 0.527
1.7AspGln: 1.7 ± 0.364
2.354AspArg: 2.354 ± 0.406
3.793AspSer: 3.793 ± 0.561
3.27AspThr: 3.27 ± 0.55
2.747AspVal: 2.747 ± 0.51
0.523AspTrp: 0.523 ± 0.234
1.962AspTyr: 1.962 ± 0.604
0.0AspXaa: 0.0 ± 0.0
Glu
3.532GluAla: 3.532 ± 0.708
1.046GluCys: 1.046 ± 0.332
2.485GluAsp: 2.485 ± 0.711
4.447GluGlu: 4.447 ± 1.409
2.354GluPhe: 2.354 ± 0.746
3.009GluGly: 3.009 ± 0.614
1.046GluHis: 1.046 ± 0.387
5.363GluIle: 5.363 ± 1.197
3.532GluLys: 3.532 ± 0.902
3.793GluLeu: 3.793 ± 0.873
1.308GluMet: 1.308 ± 0.361
3.663GluAsn: 3.663 ± 0.612
1.439GluPro: 1.439 ± 0.461
2.224GluGln: 2.224 ± 0.546
2.093GluArg: 2.093 ± 0.623
3.924GluSer: 3.924 ± 0.55
4.186GluThr: 4.186 ± 0.544
2.485GluVal: 2.485 ± 0.685
0.785GluTrp: 0.785 ± 0.231
1.962GluTyr: 1.962 ± 0.393
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.541
1.046PheCys: 1.046 ± 0.45
2.747PheAsp: 2.747 ± 0.6
2.354PheGlu: 2.354 ± 0.437
2.224PhePhe: 2.224 ± 0.584
1.57PheGly: 1.57 ± 0.31
1.57PheHis: 1.57 ± 0.468
4.055PheIle: 4.055 ± 0.407
3.793PheLys: 3.793 ± 0.627
5.232PheLeu: 5.232 ± 0.838
1.177PheMet: 1.177 ± 0.376
4.447PheAsn: 4.447 ± 0.575
2.354PhePro: 2.354 ± 0.423
2.093PheGln: 2.093 ± 0.542
2.616PheArg: 2.616 ± 0.645
3.924PheSer: 3.924 ± 0.568
2.093PheThr: 2.093 ± 0.621
2.747PheVal: 2.747 ± 0.564
0.262PheTrp: 0.262 ± 0.175
3.27PheTyr: 3.27 ± 0.471
0.0PheXaa: 0.0 ± 0.0
Gly
2.224GlyAla: 2.224 ± 0.6
0.785GlyCys: 0.785 ± 0.277
2.485GlyAsp: 2.485 ± 0.422
1.962GlyGlu: 1.962 ± 0.339
1.7GlyPhe: 1.7 ± 0.494
2.878GlyGly: 2.878 ± 0.912
1.439GlyHis: 1.439 ± 0.372
2.616GlyIle: 2.616 ± 0.454
2.485GlyLys: 2.485 ± 0.67
4.709GlyLeu: 4.709 ± 1.232
1.57GlyMet: 1.57 ± 0.339
3.27GlyAsn: 3.27 ± 0.54
1.831GlyPro: 1.831 ± 0.606
2.354GlyGln: 2.354 ± 0.709
2.747GlyArg: 2.747 ± 0.491
3.532GlySer: 3.532 ± 0.87
3.401GlyThr: 3.401 ± 0.74
1.7GlyVal: 1.7 ± 0.54
0.654GlyTrp: 0.654 ± 0.236
1.177GlyTyr: 1.177 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
1.046HisAla: 1.046 ± 0.227
0.785HisCys: 0.785 ± 0.319
0.916HisAsp: 0.916 ± 0.367
1.046HisGlu: 1.046 ± 0.394
1.439HisPhe: 1.439 ± 0.444
1.177HisGly: 1.177 ± 0.401
0.392HisHis: 0.392 ± 0.173
1.57HisIle: 1.57 ± 0.372
0.523HisLys: 0.523 ± 0.27
2.224HisLeu: 2.224 ± 0.461
0.392HisMet: 0.392 ± 0.255
1.57HisAsn: 1.57 ± 0.544
1.177HisPro: 1.177 ± 0.385
0.131HisGln: 0.131 ± 0.124
1.046HisArg: 1.046 ± 0.294
1.439HisSer: 1.439 ± 0.369
1.439HisThr: 1.439 ± 0.449
0.785HisVal: 0.785 ± 0.272
0.262HisTrp: 0.262 ± 0.189
1.046HisTyr: 1.046 ± 0.472
0.0HisXaa: 0.0 ± 0.0
Ile
4.055IleAla: 4.055 ± 0.583
0.523IleCys: 0.523 ± 0.204
3.532IleAsp: 3.532 ± 0.794
3.663IleGlu: 3.663 ± 0.605
3.401IlePhe: 3.401 ± 0.631
2.616IleGly: 2.616 ± 0.661
1.308IleHis: 1.308 ± 0.419
2.878IleIle: 2.878 ± 0.583
4.055IleLys: 4.055 ± 0.916
7.325IleLeu: 7.325 ± 1.106
0.916IleMet: 0.916 ± 0.312
4.186IleAsn: 4.186 ± 0.459
3.532IlePro: 3.532 ± 0.574
3.401IleGln: 3.401 ± 0.518
3.532IleArg: 3.532 ± 0.636
5.886IleSer: 5.886 ± 0.982
5.363IleThr: 5.363 ± 0.953
3.401IleVal: 3.401 ± 0.589
1.177IleTrp: 1.177 ± 0.404
2.878IleTyr: 2.878 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
2.747LysAla: 2.747 ± 0.592
0.916LysCys: 0.916 ± 0.253
4.055LysAsp: 4.055 ± 0.645
3.27LysGlu: 3.27 ± 0.996
4.709LysPhe: 4.709 ± 0.567
2.878LysGly: 2.878 ± 0.521
1.308LysHis: 1.308 ± 0.535
3.401LysIle: 3.401 ± 0.919
3.401LysLys: 3.401 ± 0.885
6.017LysLeu: 6.017 ± 0.704
2.093LysMet: 2.093 ± 0.598
3.793LysAsn: 3.793 ± 0.761
2.878LysPro: 2.878 ± 0.479
1.831LysGln: 1.831 ± 0.511
2.747LysArg: 2.747 ± 0.576
4.055LysSer: 4.055 ± 0.74
3.009LysThr: 3.009 ± 0.532
1.7LysVal: 1.7 ± 0.351
0.654LysTrp: 0.654 ± 0.357
2.747LysTyr: 2.747 ± 0.77
0.0LysXaa: 0.0 ± 0.0
Leu
5.101LeuAla: 5.101 ± 0.542
3.401LeuCys: 3.401 ± 0.784
6.54LeuAsp: 6.54 ± 0.986
4.447LeuGlu: 4.447 ± 0.703
3.793LeuPhe: 3.793 ± 0.604
3.532LeuGly: 3.532 ± 0.729
2.485LeuHis: 2.485 ± 0.628
4.84LeuIle: 4.84 ± 0.799
6.54LeuLys: 6.54 ± 1.268
9.418LeuLeu: 9.418 ± 1.18
1.046LeuMet: 1.046 ± 0.416
5.886LeuAsn: 5.886 ± 1.259
6.017LeuPro: 6.017 ± 0.896
6.54LeuGln: 6.54 ± 0.809
5.101LeuArg: 5.101 ± 0.803
7.063LeuSer: 7.063 ± 0.983
6.148LeuThr: 6.148 ± 0.971
4.055LeuVal: 4.055 ± 0.687
0.523LeuTrp: 0.523 ± 0.21
3.924LeuTyr: 3.924 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
0.916MetAla: 0.916 ± 0.282
0.523MetCys: 0.523 ± 0.27
1.308MetAsp: 1.308 ± 0.589
1.046MetGlu: 1.046 ± 0.331
1.308MetPhe: 1.308 ± 0.465
1.046MetGly: 1.046 ± 0.692
0.654MetHis: 0.654 ± 0.304
1.177MetIle: 1.177 ± 0.468
0.654MetLys: 0.654 ± 0.306
3.139MetLeu: 3.139 ± 0.606
0.654MetMet: 0.654 ± 0.321
1.046MetAsn: 1.046 ± 0.292
1.308MetPro: 1.308 ± 0.374
2.224MetGln: 2.224 ± 0.598
0.785MetArg: 0.785 ± 0.363
1.962MetSer: 1.962 ± 0.422
1.439MetThr: 1.439 ± 0.531
0.916MetVal: 0.916 ± 0.246
0.523MetTrp: 0.523 ± 0.253
0.916MetTyr: 0.916 ± 0.352
0.0MetXaa: 0.0 ± 0.0
Asn
2.485AsnAla: 2.485 ± 0.526
1.439AsnCys: 1.439 ± 0.722
3.009AsnAsp: 3.009 ± 0.441
4.578AsnGlu: 4.578 ± 0.867
4.971AsnPhe: 4.971 ± 0.63
2.616AsnGly: 2.616 ± 0.777
0.785AsnHis: 0.785 ± 0.233
5.625AsnIle: 5.625 ± 0.876
2.878AsnLys: 2.878 ± 0.514
6.279AsnLeu: 6.279 ± 0.73
1.046AsnMet: 1.046 ± 0.269
4.055AsnAsn: 4.055 ± 0.728
2.747AsnPro: 2.747 ± 0.806
3.663AsnGln: 3.663 ± 0.731
3.009AsnArg: 3.009 ± 0.603
5.232AsnSer: 5.232 ± 0.899
3.27AsnThr: 3.27 ± 0.608
4.317AsnVal: 4.317 ± 1.008
0.523AsnTrp: 0.523 ± 0.2
2.878AsnTyr: 2.878 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
4.186ProAla: 4.186 ± 0.909
0.785ProCys: 0.785 ± 0.291
2.354ProAsp: 2.354 ± 0.503
4.84ProGlu: 4.84 ± 1.033
2.485ProPhe: 2.485 ± 0.381
2.093ProGly: 2.093 ± 0.494
0.916ProHis: 0.916 ± 0.287
3.139ProIle: 3.139 ± 0.596
2.224ProLys: 2.224 ± 0.496
5.101ProLeu: 5.101 ± 0.917
2.093ProMet: 2.093 ± 0.385
2.354ProAsn: 2.354 ± 0.539
3.663ProPro: 3.663 ± 0.576
1.831ProGln: 1.831 ± 0.396
2.354ProArg: 2.354 ± 0.486
3.663ProSer: 3.663 ± 0.857
3.663ProThr: 3.663 ± 0.494
3.924ProVal: 3.924 ± 1.177
0.131ProTrp: 0.131 ± 0.109
1.439ProTyr: 1.439 ± 0.58
0.0ProXaa: 0.0 ± 0.0
Gln
3.009GlnAla: 3.009 ± 0.656
1.177GlnCys: 1.177 ± 0.381
1.962GlnAsp: 1.962 ± 0.391
2.747GlnGlu: 2.747 ± 0.59
1.57GlnPhe: 1.57 ± 0.419
2.224GlnGly: 2.224 ± 0.523
1.177GlnHis: 1.177 ± 0.402
3.27GlnIle: 3.27 ± 0.813
1.962GlnLys: 1.962 ± 0.494
4.186GlnLeu: 4.186 ± 0.522
1.439GlnMet: 1.439 ± 0.309
3.663GlnAsn: 3.663 ± 0.48
2.354GlnPro: 2.354 ± 0.517
2.224GlnGln: 2.224 ± 0.542
2.354GlnArg: 2.354 ± 0.679
3.009GlnSer: 3.009 ± 0.761
2.485GlnThr: 2.485 ± 0.463
2.354GlnVal: 2.354 ± 0.354
0.262GlnTrp: 0.262 ± 0.146
2.354GlnTyr: 2.354 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
2.224ArgAla: 2.224 ± 0.525
0.654ArgCys: 0.654 ± 0.226
2.224ArgAsp: 2.224 ± 0.573
2.354ArgGlu: 2.354 ± 0.5
2.093ArgPhe: 2.093 ± 0.72
2.354ArgGly: 2.354 ± 0.521
0.392ArgHis: 0.392 ± 0.226
3.793ArgIle: 3.793 ± 0.807
2.093ArgLys: 2.093 ± 0.577
4.317ArgLeu: 4.317 ± 0.842
0.916ArgMet: 0.916 ± 0.245
3.27ArgAsn: 3.27 ± 0.65
2.224ArgPro: 2.224 ± 0.584
2.354ArgGln: 2.354 ± 0.487
4.709ArgArg: 4.709 ± 1.582
3.401ArgSer: 3.401 ± 0.629
3.663ArgThr: 3.663 ± 0.523
2.485ArgVal: 2.485 ± 0.59
0.262ArgTrp: 0.262 ± 0.192
2.224ArgTyr: 2.224 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.186SerAla: 4.186 ± 0.939
1.57SerCys: 1.57 ± 0.371
3.401SerAsp: 3.401 ± 0.531
3.139SerGlu: 3.139 ± 0.718
4.317SerPhe: 4.317 ± 0.847
4.709SerGly: 4.709 ± 0.747
1.046SerHis: 1.046 ± 0.295
5.755SerIle: 5.755 ± 0.818
5.101SerLys: 5.101 ± 1.007
5.755SerLeu: 5.755 ± 0.759
1.831SerMet: 1.831 ± 0.456
5.101SerAsn: 5.101 ± 0.838
3.663SerPro: 3.663 ± 0.488
2.878SerGln: 2.878 ± 0.477
2.747SerArg: 2.747 ± 0.545
5.625SerSer: 5.625 ± 0.796
5.101SerThr: 5.101 ± 0.832
3.532SerVal: 3.532 ± 0.553
0.654SerTrp: 0.654 ± 0.301
2.093SerTyr: 2.093 ± 0.452
0.0SerXaa: 0.0 ± 0.0
Thr
3.793ThrAla: 3.793 ± 0.552
1.439ThrCys: 1.439 ± 0.603
3.793ThrAsp: 3.793 ± 0.559
3.27ThrGlu: 3.27 ± 0.726
3.532ThrPhe: 3.532 ± 0.77
3.532ThrGly: 3.532 ± 0.659
1.046ThrHis: 1.046 ± 0.632
4.447ThrIle: 4.447 ± 0.763
4.971ThrLys: 4.971 ± 0.749
4.709ThrLeu: 4.709 ± 0.756
0.916ThrMet: 0.916 ± 0.254
3.139ThrAsn: 3.139 ± 0.702
2.878ThrPro: 2.878 ± 0.493
3.009ThrGln: 3.009 ± 0.507
2.485ThrArg: 2.485 ± 0.466
4.971ThrSer: 4.971 ± 0.675
4.447ThrThr: 4.447 ± 1.039
3.793ThrVal: 3.793 ± 0.594
0.785ThrTrp: 0.785 ± 0.345
3.793ThrTyr: 3.793 ± 0.729
0.0ThrXaa: 0.0 ± 0.0
Val
2.747ValAla: 2.747 ± 0.583
0.523ValCys: 0.523 ± 0.366
2.616ValAsp: 2.616 ± 0.65
3.139ValGlu: 3.139 ± 0.438
2.093ValPhe: 2.093 ± 0.377
1.57ValGly: 1.57 ± 0.418
1.046ValHis: 1.046 ± 0.395
3.009ValIle: 3.009 ± 0.605
3.401ValLys: 3.401 ± 0.68
6.148ValLeu: 6.148 ± 1.11
0.523ValMet: 0.523 ± 0.289
3.532ValAsn: 3.532 ± 0.498
5.101ValPro: 5.101 ± 0.965
2.485ValGln: 2.485 ± 0.556
2.878ValArg: 2.878 ± 0.571
2.485ValSer: 2.485 ± 0.508
3.27ValThr: 3.27 ± 0.461
3.924ValVal: 3.924 ± 0.91
0.392ValTrp: 0.392 ± 0.195
2.224ValTyr: 2.224 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.654TrpAla: 0.654 ± 0.286
0.392TrpCys: 0.392 ± 0.169
0.0TrpAsp: 0.0 ± 0.0
0.131TrpGlu: 0.131 ± 0.109
0.392TrpPhe: 0.392 ± 0.206
0.523TrpGly: 0.523 ± 0.222
0.0TrpHis: 0.0 ± 0.0
1.046TrpIle: 1.046 ± 0.404
1.046TrpLys: 1.046 ± 0.354
0.916TrpLeu: 0.916 ± 0.391
0.131TrpMet: 0.131 ± 0.109
0.785TrpAsn: 0.785 ± 0.263
0.785TrpPro: 0.785 ± 0.375
0.785TrpGln: 0.785 ± 0.379
0.262TrpArg: 0.262 ± 0.169
0.654TrpSer: 0.654 ± 0.282
1.177TrpThr: 1.177 ± 0.45
0.131TrpVal: 0.131 ± 0.118
0.262TrpTrp: 0.262 ± 0.162
0.131TrpTyr: 0.131 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.831TyrAla: 1.831 ± 0.32
0.785TyrCys: 0.785 ± 0.28
2.485TyrAsp: 2.485 ± 0.819
1.7TyrGlu: 1.7 ± 0.393
2.354TyrPhe: 2.354 ± 0.646
1.831TyrGly: 1.831 ± 0.572
1.177TyrHis: 1.177 ± 0.381
1.308TyrIle: 1.308 ± 0.346
2.616TyrLys: 2.616 ± 0.466
3.793TyrLeu: 3.793 ± 0.832
2.224TyrMet: 2.224 ± 0.539
3.139TyrAsn: 3.139 ± 0.568
2.747TyrPro: 2.747 ± 0.597
1.57TyrGln: 1.57 ± 0.419
1.57TyrArg: 1.57 ± 0.285
2.747TyrSer: 2.747 ± 0.355
2.616TyrThr: 2.616 ± 0.482
2.747TyrVal: 2.747 ± 0.475
0.785TyrTrp: 0.785 ± 0.297
0.916TyrTyr: 0.916 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (7646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski