Amino acid dipepetide frequency for Pigeon adenovirus 2a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.941AlaAla: 4.941 ± 1.14
0.841AlaCys: 0.841 ± 0.34
3.259AlaAsp: 3.259 ± 0.629
4.1AlaGlu: 4.1 ± 0.614
2.944AlaPhe: 2.944 ± 0.47
3.89AlaGly: 3.89 ± 0.911
1.682AlaHis: 1.682 ± 0.681
4.1AlaIle: 4.1 ± 0.761
2.313AlaLys: 2.313 ± 0.562
7.149AlaLeu: 7.149 ± 0.891
2.208AlaMet: 2.208 ± 0.797
2.733AlaAsn: 2.733 ± 0.683
3.574AlaPro: 3.574 ± 0.753
3.259AlaGln: 3.259 ± 0.684
3.469AlaArg: 3.469 ± 0.537
4.31AlaSer: 4.31 ± 0.768
5.467AlaThr: 5.467 ± 0.947
4.31AlaVal: 4.31 ± 0.709
0.421AlaTrp: 0.421 ± 0.192
2.313AlaTyr: 2.313 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.736CysAla: 0.736 ± 0.295
0.21CysCys: 0.21 ± 0.158
0.841CysAsp: 0.841 ± 0.292
0.421CysGlu: 0.421 ± 0.176
0.841CysPhe: 0.841 ± 0.254
1.051CysGly: 1.051 ± 0.377
0.736CysHis: 0.736 ± 0.263
0.631CysIle: 0.631 ± 0.293
0.841CysLys: 0.841 ± 0.312
1.156CysLeu: 1.156 ± 0.454
0.315CysMet: 0.315 ± 0.16
1.051CysAsn: 1.051 ± 0.333
0.841CysPro: 0.841 ± 0.332
0.631CysGln: 0.631 ± 0.184
0.736CysArg: 0.736 ± 0.327
1.787CysSer: 1.787 ± 0.472
0.631CysThr: 0.631 ± 0.281
0.526CysVal: 0.526 ± 0.229
0.21CysTrp: 0.21 ± 0.156
0.526CysTyr: 0.526 ± 0.209
0.0CysXaa: 0.0 ± 0.0
Asp
3.574AspAla: 3.574 ± 0.93
0.421AspCys: 0.421 ± 0.187
3.049AspAsp: 3.049 ± 0.556
3.364AspGlu: 3.364 ± 0.672
2.208AspPhe: 2.208 ± 0.401
3.785AspGly: 3.785 ± 0.668
0.946AspHis: 0.946 ± 0.349
3.995AspIle: 3.995 ± 0.783
2.103AspLys: 2.103 ± 0.45
6.098AspLeu: 6.098 ± 0.695
1.156AspMet: 1.156 ± 0.359
2.944AspAsn: 2.944 ± 0.723
4.1AspPro: 4.1 ± 0.641
1.787AspGln: 1.787 ± 0.449
2.523AspArg: 2.523 ± 0.528
4.415AspSer: 4.415 ± 0.943
3.364AspThr: 3.364 ± 0.539
3.574AspVal: 3.574 ± 0.575
0.315AspTrp: 0.315 ± 0.231
2.313AspTyr: 2.313 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
3.154GluAla: 3.154 ± 0.305
0.421GluCys: 0.421 ± 0.191
3.469GluAsp: 3.469 ± 0.666
5.782GluGlu: 5.782 ± 1.534
1.682GluPhe: 1.682 ± 0.47
3.68GluGly: 3.68 ± 0.807
1.262GluHis: 1.262 ± 0.328
2.523GluIle: 2.523 ± 0.504
2.733GluLys: 2.733 ± 0.744
6.203GluLeu: 6.203 ± 0.761
1.682GluMet: 1.682 ± 0.522
2.628GluAsn: 2.628 ± 0.659
1.997GluPro: 1.997 ± 0.556
2.839GluGln: 2.839 ± 0.723
3.89GluArg: 3.89 ± 0.678
3.469GluSer: 3.469 ± 0.632
4.415GluThr: 4.415 ± 0.659
2.839GluVal: 2.839 ± 0.687
0.841GluTrp: 0.841 ± 0.214
1.892GluTyr: 1.892 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.259PheAla: 3.259 ± 0.784
0.526PheCys: 0.526 ± 0.248
2.944PheAsp: 2.944 ± 0.606
2.313PheGlu: 2.313 ± 0.428
2.313PhePhe: 2.313 ± 0.488
1.367PheGly: 1.367 ± 0.4
1.367PheHis: 1.367 ± 0.337
1.892PheIle: 1.892 ± 0.415
1.577PheLys: 1.577 ± 0.408
4.1PheLeu: 4.1 ± 0.815
1.051PheMet: 1.051 ± 0.328
1.892PheAsn: 1.892 ± 0.495
2.418PhePro: 2.418 ± 0.597
1.051PheGln: 1.051 ± 0.276
3.995PheArg: 3.995 ± 0.513
3.89PheSer: 3.89 ± 0.637
2.313PheThr: 2.313 ± 0.399
2.733PheVal: 2.733 ± 0.444
0.315PheTrp: 0.315 ± 0.167
2.523PheTyr: 2.523 ± 0.471
0.0PheXaa: 0.0 ± 0.0
Gly
5.257GlyAla: 5.257 ± 0.743
0.315GlyCys: 0.315 ± 0.195
2.418GlyAsp: 2.418 ± 0.393
3.049GlyGlu: 3.049 ± 0.689
1.367GlyPhe: 1.367 ± 0.366
5.362GlyGly: 5.362 ± 0.869
0.841GlyHis: 0.841 ± 0.468
2.628GlyIle: 2.628 ± 0.927
2.733GlyLys: 2.733 ± 0.562
5.257GlyLeu: 5.257 ± 0.999
1.682GlyMet: 1.682 ± 0.394
2.839GlyAsn: 2.839 ± 0.74
3.049GlyPro: 3.049 ± 0.503
2.208GlyGln: 2.208 ± 0.45
4.731GlyArg: 4.731 ± 0.912
2.944GlySer: 2.944 ± 0.537
4.521GlyThr: 4.521 ± 0.752
4.731GlyVal: 4.731 ± 0.865
0.946GlyTrp: 0.946 ± 0.258
2.523GlyTyr: 2.523 ± 0.294
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.312
0.631HisCys: 0.631 ± 0.303
0.946HisAsp: 0.946 ± 0.315
1.051HisGlu: 1.051 ± 0.382
0.736HisPhe: 0.736 ± 0.31
1.156HisGly: 1.156 ± 0.388
0.736HisHis: 0.736 ± 0.242
1.156HisIle: 1.156 ± 0.275
0.841HisLys: 0.841 ± 0.265
2.208HisLeu: 2.208 ± 0.51
0.21HisMet: 0.21 ± 0.158
1.577HisAsn: 1.577 ± 0.306
2.839HisPro: 2.839 ± 0.917
0.631HisGln: 0.631 ± 0.244
2.733HisArg: 2.733 ± 0.449
1.682HisSer: 1.682 ± 0.288
0.841HisThr: 0.841 ± 0.222
1.892HisVal: 1.892 ± 0.595
0.315HisTrp: 0.315 ± 0.176
0.946HisTyr: 0.946 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
3.89IleAla: 3.89 ± 0.565
1.051IleCys: 1.051 ± 0.372
2.839IleAsp: 2.839 ± 0.71
2.313IleGlu: 2.313 ± 0.534
1.577IlePhe: 1.577 ± 0.418
2.839IleGly: 2.839 ± 0.336
1.156IleHis: 1.156 ± 0.306
1.577IleIle: 1.577 ± 0.599
2.628IleLys: 2.628 ± 0.576
3.154IleLeu: 3.154 ± 0.49
0.946IleMet: 0.946 ± 0.274
1.997IleAsn: 1.997 ± 0.545
4.1IlePro: 4.1 ± 0.653
1.892IleGln: 1.892 ± 0.436
2.839IleArg: 2.839 ± 0.499
2.733IleSer: 2.733 ± 0.586
2.103IleThr: 2.103 ± 0.415
2.103IleVal: 2.103 ± 0.399
0.736IleTrp: 0.736 ± 0.266
2.208IleTyr: 2.208 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
2.628LysAla: 2.628 ± 0.642
0.841LysCys: 0.841 ± 0.351
2.523LysAsp: 2.523 ± 0.625
2.208LysGlu: 2.208 ± 0.633
1.892LysPhe: 1.892 ± 0.406
2.103LysGly: 2.103 ± 0.494
0.841LysHis: 0.841 ± 0.287
1.682LysIle: 1.682 ± 0.601
1.472LysLys: 1.472 ± 0.495
4.1LysLeu: 4.1 ± 0.624
0.631LysMet: 0.631 ± 0.229
1.682LysAsn: 1.682 ± 0.417
1.367LysPro: 1.367 ± 0.308
1.577LysGln: 1.577 ± 0.306
3.469LysArg: 3.469 ± 0.559
1.997LysSer: 1.997 ± 0.679
3.469LysThr: 3.469 ± 0.482
1.577LysVal: 1.577 ± 0.377
0.631LysTrp: 0.631 ± 0.238
1.156LysTyr: 1.156 ± 0.42
0.0LysXaa: 0.0 ± 0.0
Leu
5.151LeuAla: 5.151 ± 0.572
1.682LeuCys: 1.682 ± 0.366
4.521LeuAsp: 4.521 ± 0.514
4.1LeuGlu: 4.1 ± 0.602
3.89LeuPhe: 3.89 ± 0.765
4.836LeuGly: 4.836 ± 0.676
3.469LeuHis: 3.469 ± 0.71
3.995LeuIle: 3.995 ± 0.669
3.364LeuLys: 3.364 ± 0.641
8.936LeuLeu: 8.936 ± 1.053
2.733LeuMet: 2.733 ± 0.499
5.467LeuAsn: 5.467 ± 1.098
5.046LeuPro: 5.046 ± 0.922
5.572LeuGln: 5.572 ± 0.742
7.99LeuArg: 7.99 ± 0.988
7.359LeuSer: 7.359 ± 0.837
6.623LeuThr: 6.623 ± 0.877
3.995LeuVal: 3.995 ± 0.736
1.577LeuTrp: 1.577 ± 0.429
4.941LeuTyr: 4.941 ± 0.685
0.0LeuXaa: 0.0 ± 0.0
Met
2.418MetAla: 2.418 ± 0.54
0.421MetCys: 0.421 ± 0.227
1.577MetAsp: 1.577 ± 0.33
1.472MetGlu: 1.472 ± 0.448
0.736MetPhe: 0.736 ± 0.265
1.262MetGly: 1.262 ± 0.327
0.841MetHis: 0.841 ± 0.339
0.946MetIle: 0.946 ± 0.249
0.631MetLys: 0.631 ± 0.315
1.787MetLeu: 1.787 ± 0.452
0.736MetMet: 0.736 ± 0.181
1.051MetAsn: 1.051 ± 0.357
1.367MetPro: 1.367 ± 0.388
1.051MetGln: 1.051 ± 0.256
1.682MetArg: 1.682 ± 0.465
2.628MetSer: 2.628 ± 0.554
0.946MetThr: 0.946 ± 0.254
1.367MetVal: 1.367 ± 0.35
0.315MetTrp: 0.315 ± 0.166
1.472MetTyr: 1.472 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
3.995AsnAla: 3.995 ± 0.795
0.526AsnCys: 0.526 ± 0.27
2.208AsnAsp: 2.208 ± 0.491
3.154AsnGlu: 3.154 ± 0.824
2.103AsnPhe: 2.103 ± 0.618
3.469AsnGly: 3.469 ± 0.735
1.156AsnHis: 1.156 ± 0.265
2.313AsnIle: 2.313 ± 0.634
1.682AsnLys: 1.682 ± 0.311
3.89AsnLeu: 3.89 ± 0.643
1.156AsnMet: 1.156 ± 0.333
3.364AsnAsn: 3.364 ± 0.855
4.31AsnPro: 4.31 ± 0.58
1.787AsnGln: 1.787 ± 0.495
3.154AsnArg: 3.154 ± 0.588
3.049AsnSer: 3.049 ± 0.552
3.259AsnThr: 3.259 ± 0.643
3.785AsnVal: 3.785 ± 1.056
1.156AsnTrp: 1.156 ± 0.522
2.208AsnTyr: 2.208 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
3.89ProAla: 3.89 ± 0.908
0.736ProCys: 0.736 ± 0.332
3.89ProAsp: 3.89 ± 0.785
3.469ProGlu: 3.469 ± 0.506
3.259ProPhe: 3.259 ± 0.518
2.733ProGly: 2.733 ± 0.647
1.892ProHis: 1.892 ± 0.363
1.892ProIle: 1.892 ± 0.585
2.313ProLys: 2.313 ± 0.569
5.467ProLeu: 5.467 ± 1.059
1.367ProMet: 1.367 ± 0.49
2.628ProAsn: 2.628 ± 0.563
5.151ProPro: 5.151 ± 0.902
1.892ProGln: 1.892 ± 0.453
2.944ProArg: 2.944 ± 0.437
6.413ProSer: 6.413 ± 0.874
3.574ProThr: 3.574 ± 0.632
4.1ProVal: 4.1 ± 0.696
0.315ProTrp: 0.315 ± 0.199
3.049ProTyr: 3.049 ± 0.609
0.0ProXaa: 0.0 ± 0.0
Gln
3.364GlnAla: 3.364 ± 0.629
0.21GlnCys: 0.21 ± 0.163
1.682GlnAsp: 1.682 ± 0.426
2.523GlnGlu: 2.523 ± 0.499
2.313GlnPhe: 2.313 ± 0.562
1.787GlnGly: 1.787 ± 0.523
0.841GlnHis: 0.841 ± 0.263
1.367GlnIle: 1.367 ± 0.365
1.262GlnLys: 1.262 ± 0.362
4.415GlnLeu: 4.415 ± 0.618
1.577GlnMet: 1.577 ± 0.425
2.523GlnAsn: 2.523 ± 0.565
2.523GlnPro: 2.523 ± 0.46
1.892GlnGln: 1.892 ± 0.415
2.733GlnArg: 2.733 ± 0.493
2.839GlnSer: 2.839 ± 0.581
2.523GlnThr: 2.523 ± 0.523
1.682GlnVal: 1.682 ± 0.34
0.421GlnTrp: 0.421 ± 0.309
1.577GlnTyr: 1.577 ± 0.413
0.0GlnXaa: 0.0 ± 0.0
Arg
3.68ArgAla: 3.68 ± 0.535
1.156ArgCys: 1.156 ± 0.555
3.154ArgAsp: 3.154 ± 0.612
3.785ArgGlu: 3.785 ± 0.888
3.785ArgPhe: 3.785 ± 0.407
4.415ArgGly: 4.415 ± 0.872
1.787ArgHis: 1.787 ± 0.375
2.313ArgIle: 2.313 ± 0.524
3.154ArgLys: 3.154 ± 0.552
6.623ArgLeu: 6.623 ± 0.846
1.892ArgMet: 1.892 ± 0.468
3.68ArgAsn: 3.68 ± 0.804
3.259ArgPro: 3.259 ± 0.534
3.259ArgGln: 3.259 ± 0.429
6.728ArgArg: 6.728 ± 2.308
4.626ArgSer: 4.626 ± 0.749
4.521ArgThr: 4.521 ± 0.734
5.046ArgVal: 5.046 ± 1.055
1.156ArgTrp: 1.156 ± 0.305
2.418ArgTyr: 2.418 ± 0.513
0.0ArgXaa: 0.0 ± 0.0
Ser
4.205SerAla: 4.205 ± 0.763
1.472SerCys: 1.472 ± 0.517
4.31SerAsp: 4.31 ± 0.666
4.31SerGlu: 4.31 ± 0.724
3.049SerPhe: 3.049 ± 0.596
6.098SerGly: 6.098 ± 1.3
1.367SerHis: 1.367 ± 0.419
4.205SerIle: 4.205 ± 0.718
2.313SerLys: 2.313 ± 0.54
7.254SerLeu: 7.254 ± 0.876
1.577SerMet: 1.577 ± 0.385
3.364SerAsn: 3.364 ± 0.541
3.995SerPro: 3.995 ± 0.74
2.733SerGln: 2.733 ± 0.656
5.151SerArg: 5.151 ± 0.965
7.359SerSer: 7.359 ± 1.38
3.469SerThr: 3.469 ± 0.595
3.89SerVal: 3.89 ± 0.476
1.262SerTrp: 1.262 ± 0.47
2.208SerTyr: 2.208 ± 0.632
0.0SerXaa: 0.0 ± 0.0
Thr
4.31ThrAla: 4.31 ± 0.636
0.526ThrCys: 0.526 ± 0.254
4.941ThrAsp: 4.941 ± 0.625
4.31ThrGlu: 4.31 ± 0.729
3.154ThrPhe: 3.154 ± 0.54
3.364ThrGly: 3.364 ± 0.542
1.367ThrHis: 1.367 ± 0.386
2.313ThrIle: 2.313 ± 0.402
1.682ThrLys: 1.682 ± 0.369
5.887ThrLeu: 5.887 ± 0.764
1.367ThrMet: 1.367 ± 0.339
3.259ThrAsn: 3.259 ± 0.712
4.521ThrPro: 4.521 ± 0.827
1.682ThrGln: 1.682 ± 0.351
3.89ThrArg: 3.89 ± 0.743
4.521ThrSer: 4.521 ± 0.797
3.364ThrThr: 3.364 ± 0.589
4.626ThrVal: 4.626 ± 0.757
0.631ThrTrp: 0.631 ± 0.201
2.628ThrTyr: 2.628 ± 0.619
0.0ThrXaa: 0.0 ± 0.0
Val
3.995ValAla: 3.995 ± 0.488
1.577ValCys: 1.577 ± 0.315
3.68ValAsp: 3.68 ± 0.541
2.628ValGlu: 2.628 ± 0.565
3.049ValPhe: 3.049 ± 0.671
3.364ValGly: 3.364 ± 0.512
1.262ValHis: 1.262 ± 0.491
3.049ValIle: 3.049 ± 0.611
1.892ValLys: 1.892 ± 0.419
6.308ValLeu: 6.308 ± 0.869
0.946ValMet: 0.946 ± 0.242
3.89ValAsn: 3.89 ± 0.747
3.469ValPro: 3.469 ± 0.719
2.733ValGln: 2.733 ± 0.602
4.731ValArg: 4.731 ± 0.748
3.469ValSer: 3.469 ± 0.537
3.89ValThr: 3.89 ± 0.703
4.31ValVal: 4.31 ± 0.654
0.631ValTrp: 0.631 ± 0.3
2.208ValTyr: 2.208 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
0.946TrpAla: 0.946 ± 0.26
0.421TrpCys: 0.421 ± 0.241
0.631TrpAsp: 0.631 ± 0.278
0.736TrpGlu: 0.736 ± 0.199
0.631TrpPhe: 0.631 ± 0.262
0.631TrpGly: 0.631 ± 0.234
0.0TrpHis: 0.0 ± 0.0
0.526TrpIle: 0.526 ± 0.293
0.736TrpLys: 0.736 ± 0.326
0.841TrpLeu: 0.841 ± 0.279
0.105TrpMet: 0.105 ± 0.11
1.051TrpAsn: 1.051 ± 0.349
0.631TrpPro: 0.631 ± 0.407
0.841TrpGln: 0.841 ± 0.239
0.421TrpArg: 0.421 ± 0.189
1.051TrpSer: 1.051 ± 0.485
0.841TrpThr: 0.841 ± 0.251
1.156TrpVal: 1.156 ± 0.355
0.21TrpTrp: 0.21 ± 0.181
0.736TrpTyr: 0.736 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.628TyrAla: 2.628 ± 0.572
0.736TyrCys: 0.736 ± 0.201
3.154TyrAsp: 3.154 ± 0.56
2.313TyrGlu: 2.313 ± 0.598
2.313TyrPhe: 2.313 ± 0.626
2.208TyrGly: 2.208 ± 0.514
0.841TyrHis: 0.841 ± 0.271
1.577TyrIle: 1.577 ± 0.503
1.577TyrLys: 1.577 ± 0.426
4.415TyrLeu: 4.415 ± 0.523
1.262TyrMet: 1.262 ± 0.428
2.103TyrAsn: 2.103 ± 0.37
2.418TyrPro: 2.418 ± 0.515
0.736TyrGln: 0.736 ± 0.198
2.628TyrArg: 2.628 ± 0.52
3.049TyrSer: 3.049 ± 0.576
2.208TyrThr: 2.208 ± 0.448
2.944TyrVal: 2.944 ± 0.615
0.736TyrTrp: 0.736 ± 0.28
1.262TyrTyr: 1.262 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (9513 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski