Amino acid dipepetide frequency for Kimberley virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.297AlaAla: 1.297 ± 0.702
1.081AlaCys: 1.081 ± 0.404
1.513AlaAsp: 1.513 ± 0.514
1.297AlaGlu: 1.297 ± 0.446
1.297AlaPhe: 1.297 ± 0.501
1.946AlaGly: 1.946 ± 0.969
0.865AlaHis: 0.865 ± 0.28
2.378AlaIle: 2.378 ± 0.618
3.026AlaLys: 3.026 ± 0.488
2.378AlaLeu: 2.378 ± 0.864
0.865AlaMet: 0.865 ± 0.646
0.865AlaAsn: 0.865 ± 0.384
0.649AlaPro: 0.649 ± 0.423
1.297AlaGln: 1.297 ± 0.292
1.297AlaArg: 1.297 ± 0.603
1.297AlaSer: 1.297 ± 0.48
0.865AlaThr: 0.865 ± 0.562
1.297AlaVal: 1.297 ± 0.399
0.0AlaTrp: 0.0 ± 0.0
1.729AlaTyr: 1.729 ± 0.552
0.0AlaXaa: 0.0 ± 0.0
Cys
0.649CysAla: 0.649 ± 0.32
0.216CysCys: 0.216 ± 0.141
0.865CysAsp: 0.865 ± 0.436
1.729CysGlu: 1.729 ± 0.543
0.649CysPhe: 0.649 ± 0.284
1.946CysGly: 1.946 ± 0.471
0.649CysHis: 0.649 ± 0.28
0.865CysIle: 0.865 ± 0.319
1.946CysLys: 1.946 ± 0.655
2.378CysLeu: 2.378 ± 0.431
0.649CysMet: 0.649 ± 0.417
0.865CysAsn: 0.865 ± 0.279
0.649CysPro: 0.649 ± 0.473
1.081CysGln: 1.081 ± 0.352
0.865CysArg: 0.865 ± 0.596
1.081CysSer: 1.081 ± 0.526
1.729CysThr: 1.729 ± 0.999
0.865CysVal: 0.865 ± 0.401
0.216CysTrp: 0.216 ± 0.141
0.432CysTyr: 0.432 ± 0.281
0.0CysXaa: 0.0 ± 0.0
Asp
1.729AspAla: 1.729 ± 0.751
0.865AspCys: 0.865 ± 0.335
4.323AspAsp: 4.323 ± 1.335
5.188AspGlu: 5.188 ± 0.874
1.513AspPhe: 1.513 ± 0.56
3.891AspGly: 3.891 ± 0.962
0.865AspHis: 0.865 ± 0.434
6.053AspIle: 6.053 ± 2.008
6.053AspLys: 6.053 ± 0.538
6.053AspLeu: 6.053 ± 1.619
2.81AspMet: 2.81 ± 0.789
3.459AspAsn: 3.459 ± 0.994
1.729AspPro: 1.729 ± 0.361
2.594AspGln: 2.594 ± 0.639
2.162AspArg: 2.162 ± 0.399
2.594AspSer: 2.594 ± 1.018
1.946AspThr: 1.946 ± 0.396
2.594AspVal: 2.594 ± 0.67
1.513AspTrp: 1.513 ± 0.474
3.459AspTyr: 3.459 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
2.378GluAla: 2.378 ± 0.719
0.865GluCys: 0.865 ± 0.462
5.404GluAsp: 5.404 ± 1.25
6.053GluGlu: 6.053 ± 1.235
4.107GluPhe: 4.107 ± 0.951
5.188GluGly: 5.188 ± 0.872
0.865GluHis: 0.865 ± 0.342
6.701GluIle: 6.701 ± 1.277
6.269GluLys: 6.269 ± 1.023
10.16GluLeu: 10.16 ± 0.888
1.729GluMet: 1.729 ± 0.437
3.675GluAsn: 3.675 ± 0.545
1.946GluPro: 1.946 ± 0.473
1.297GluGln: 1.297 ± 0.645
3.891GluArg: 3.891 ± 0.804
5.404GluSer: 5.404 ± 1.01
3.026GluThr: 3.026 ± 1.053
4.756GluVal: 4.756 ± 0.922
1.297GluTrp: 1.297 ± 0.436
2.594GluTyr: 2.594 ± 0.577
0.0GluXaa: 0.0 ± 0.0
Phe
0.649PheAla: 0.649 ± 0.301
1.297PheCys: 1.297 ± 0.644
1.946PheAsp: 1.946 ± 0.606
2.81PheGlu: 2.81 ± 0.838
2.378PhePhe: 2.378 ± 0.542
1.729PheGly: 1.729 ± 0.63
0.649PheHis: 0.649 ± 0.266
2.378PheIle: 2.378 ± 0.678
3.891PheLys: 3.891 ± 0.994
3.459PheLeu: 3.459 ± 0.856
0.649PheMet: 0.649 ± 0.32
3.243PheAsn: 3.243 ± 0.843
1.513PhePro: 1.513 ± 0.459
1.081PheGln: 1.081 ± 0.499
2.81PheArg: 2.81 ± 0.501
2.378PheSer: 2.378 ± 0.692
1.297PheThr: 1.297 ± 0.632
1.729PheVal: 1.729 ± 0.561
0.865PheTrp: 0.865 ± 0.396
1.297PheTyr: 1.297 ± 0.72
0.0PheXaa: 0.0 ± 0.0
Gly
0.649GlyAla: 0.649 ± 0.284
0.649GlyCys: 0.649 ± 0.417
4.323GlyAsp: 4.323 ± 0.715
4.972GlyGlu: 4.972 ± 0.909
3.026GlyPhe: 3.026 ± 0.719
3.459GlyGly: 3.459 ± 0.841
1.081GlyHis: 1.081 ± 0.556
6.269GlyIle: 6.269 ± 1.884
5.62GlyLys: 5.62 ± 1.102
6.485GlyLeu: 6.485 ± 1.93
1.297GlyMet: 1.297 ± 0.288
2.378GlyAsn: 2.378 ± 0.807
1.081GlyPro: 1.081 ± 0.442
1.946GlyGln: 1.946 ± 0.407
2.81GlyArg: 2.81 ± 1.173
4.54GlySer: 4.54 ± 1.052
3.891GlyThr: 3.891 ± 0.847
3.459GlyVal: 3.459 ± 1.024
1.081GlyTrp: 1.081 ± 0.352
2.81GlyTyr: 2.81 ± 0.721
0.0GlyXaa: 0.0 ± 0.0
His
0.432HisAla: 0.432 ± 0.217
0.649HisCys: 0.649 ± 0.29
1.513HisAsp: 1.513 ± 0.348
1.297HisGlu: 1.297 ± 0.402
1.729HisPhe: 1.729 ± 0.812
1.946HisGly: 1.946 ± 0.47
0.432HisHis: 0.432 ± 0.38
1.946HisIle: 1.946 ± 0.331
2.162HisLys: 2.162 ± 0.505
1.297HisLeu: 1.297 ± 0.352
0.649HisMet: 0.649 ± 0.547
1.081HisAsn: 1.081 ± 0.446
1.729HisPro: 1.729 ± 0.643
0.432HisGln: 0.432 ± 0.217
1.513HisArg: 1.513 ± 0.509
0.649HisSer: 0.649 ± 0.346
0.0HisThr: 0.0 ± 0.0
1.513HisVal: 1.513 ± 0.752
0.865HisTrp: 0.865 ± 0.434
1.081HisTyr: 1.081 ± 0.506
0.0HisXaa: 0.0 ± 0.0
Ile
2.594IleAla: 2.594 ± 0.855
2.378IleCys: 2.378 ± 0.69
5.188IleAsp: 5.188 ± 0.86
5.188IleGlu: 5.188 ± 1.53
2.594IlePhe: 2.594 ± 0.677
6.269IleGly: 6.269 ± 0.593
0.865IleHis: 0.865 ± 0.571
6.269IleIle: 6.269 ± 1.287
9.511IleLys: 9.511 ± 1.194
7.134IleLeu: 7.134 ± 1.375
1.297IleMet: 1.297 ± 0.437
5.188IleAsn: 5.188 ± 1.23
2.594IlePro: 2.594 ± 0.491
2.162IleGln: 2.162 ± 0.475
4.54IleArg: 4.54 ± 0.651
6.701IleSer: 6.701 ± 1.33
2.594IleThr: 2.594 ± 1.705
4.972IleVal: 4.972 ± 0.867
1.513IleTrp: 1.513 ± 0.517
3.243IleTyr: 3.243 ± 0.95
0.0IleXaa: 0.0 ± 0.0
Lys
2.162LysAla: 2.162 ± 1.008
1.729LysCys: 1.729 ± 0.578
4.972LysAsp: 4.972 ± 1.547
8.431LysGlu: 8.431 ± 1.888
2.594LysPhe: 2.594 ± 1.303
7.35LysGly: 7.35 ± 1.458
2.378LysHis: 2.378 ± 0.648
7.566LysIle: 7.566 ± 1.225
9.511LysLys: 9.511 ± 1.72
7.782LysLeu: 7.782 ± 1.306
2.594LysMet: 2.594 ± 0.732
6.701LysAsn: 6.701 ± 1.87
4.107LysPro: 4.107 ± 1.364
0.432LysGln: 0.432 ± 0.281
3.243LysArg: 3.243 ± 1.205
6.917LysSer: 6.917 ± 0.878
2.378LysThr: 2.378 ± 0.776
5.404LysVal: 5.404 ± 0.984
1.729LysTrp: 1.729 ± 0.682
3.026LysTyr: 3.026 ± 1.062
0.0LysXaa: 0.0 ± 0.0
Leu
3.026LeuAla: 3.026 ± 0.657
1.729LeuCys: 1.729 ± 0.768
7.35LeuAsp: 7.35 ± 1.392
7.35LeuGlu: 7.35 ± 1.48
2.378LeuPhe: 2.378 ± 0.443
5.188LeuGly: 5.188 ± 1.074
1.946LeuHis: 1.946 ± 0.375
9.079LeuIle: 9.079 ± 1.93
7.782LeuLys: 7.782 ± 1.521
6.269LeuLeu: 6.269 ± 1.117
2.81LeuMet: 2.81 ± 0.65
7.134LeuAsn: 7.134 ± 1.659
2.81LeuPro: 2.81 ± 0.605
3.459LeuGln: 3.459 ± 0.783
5.62LeuArg: 5.62 ± 1.653
7.35LeuSer: 7.35 ± 1.468
4.107LeuThr: 4.107 ± 1.079
3.675LeuVal: 3.675 ± 0.86
0.865LeuTrp: 0.865 ± 0.434
2.378LeuTyr: 2.378 ± 0.409
0.0LeuXaa: 0.0 ± 0.0
Met
1.081MetAla: 1.081 ± 0.451
0.865MetCys: 0.865 ± 0.279
1.729MetAsp: 1.729 ± 1.076
3.026MetGlu: 3.026 ± 0.473
1.297MetPhe: 1.297 ± 0.452
0.865MetGly: 0.865 ± 0.388
0.216MetHis: 0.216 ± 0.141
2.162MetIle: 2.162 ± 0.709
2.162MetLys: 2.162 ± 0.727
2.378MetLeu: 2.378 ± 1.036
0.865MetMet: 0.865 ± 0.531
2.162MetAsn: 2.162 ± 0.305
0.432MetPro: 0.432 ± 0.599
0.216MetGln: 0.216 ± 0.141
0.865MetArg: 0.865 ± 0.396
2.162MetSer: 2.162 ± 0.61
1.081MetThr: 1.081 ± 0.506
2.594MetVal: 2.594 ± 0.425
0.216MetTrp: 0.216 ± 0.244
0.865MetTyr: 0.865 ± 0.42
0.0MetXaa: 0.0 ± 0.0
Asn
1.729AsnAla: 1.729 ± 1.106
2.378AsnCys: 2.378 ± 0.609
3.891AsnAsp: 3.891 ± 0.584
4.756AsnGlu: 4.756 ± 1.333
2.594AsnPhe: 2.594 ± 0.989
2.162AsnGly: 2.162 ± 0.839
2.162AsnHis: 2.162 ± 0.58
4.972AsnIle: 4.972 ± 1.282
6.053AsnLys: 6.053 ± 1.227
7.782AsnLeu: 7.782 ± 1.07
1.297AsnMet: 1.297 ± 0.481
4.756AsnAsn: 4.756 ± 0.83
1.946AsnPro: 1.946 ± 0.297
3.026AsnGln: 3.026 ± 0.629
2.81AsnArg: 2.81 ± 0.623
3.675AsnSer: 3.675 ± 1.133
2.162AsnThr: 2.162 ± 0.616
2.378AsnVal: 2.378 ± 1.19
2.378AsnTrp: 2.378 ± 0.876
2.81AsnTyr: 2.81 ± 1.04
0.0AsnXaa: 0.0 ± 0.0
Pro
1.297ProAla: 1.297 ± 0.657
0.216ProCys: 0.216 ± 0.141
1.729ProAsp: 1.729 ± 0.272
2.162ProGlu: 2.162 ± 0.575
1.946ProPhe: 1.946 ± 1.391
1.946ProGly: 1.946 ± 1.049
1.297ProHis: 1.297 ± 0.568
2.162ProIle: 2.162 ± 0.776
1.946ProLys: 1.946 ± 0.821
3.026ProLeu: 3.026 ± 1.063
0.865ProMet: 0.865 ± 0.438
1.297ProAsn: 1.297 ± 0.419
1.729ProPro: 1.729 ± 0.847
0.649ProGln: 0.649 ± 0.612
1.946ProArg: 1.946 ± 0.5
2.81ProSer: 2.81 ± 0.474
2.162ProThr: 2.162 ± 0.8
1.729ProVal: 1.729 ± 0.597
0.432ProTrp: 0.432 ± 0.294
2.594ProTyr: 2.594 ± 0.855
0.0ProXaa: 0.0 ± 0.0
Gln
0.216GlnAla: 0.216 ± 0.223
0.216GlnCys: 0.216 ± 0.141
1.729GlnAsp: 1.729 ± 0.467
1.729GlnGlu: 1.729 ± 0.624
0.649GlnPhe: 0.649 ± 0.365
1.946GlnGly: 1.946 ± 0.483
0.649GlnHis: 0.649 ± 0.32
3.675GlnIle: 3.675 ± 0.828
2.378GlnLys: 2.378 ± 0.307
1.513GlnLeu: 1.513 ± 0.753
1.297GlnMet: 1.297 ± 0.843
2.162GlnAsn: 2.162 ± 0.422
0.649GlnPro: 0.649 ± 0.301
0.432GlnGln: 0.432 ± 0.281
1.081GlnArg: 1.081 ± 0.352
1.946GlnSer: 1.946 ± 0.398
0.865GlnThr: 0.865 ± 0.531
1.946GlnVal: 1.946 ± 0.479
0.216GlnTrp: 0.216 ± 0.141
0.432GlnTyr: 0.432 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
1.297ArgAla: 1.297 ± 0.843
1.297ArgCys: 1.297 ± 0.334
2.594ArgAsp: 2.594 ± 0.953
2.81ArgGlu: 2.81 ± 0.671
2.378ArgPhe: 2.378 ± 0.788
3.675ArgGly: 3.675 ± 0.636
1.513ArgHis: 1.513 ± 0.702
2.594ArgIle: 2.594 ± 0.829
4.54ArgLys: 4.54 ± 0.987
3.459ArgLeu: 3.459 ± 0.837
1.297ArgMet: 1.297 ± 0.607
2.594ArgAsn: 2.594 ± 0.66
1.946ArgPro: 1.946 ± 0.478
0.865ArgGln: 0.865 ± 0.331
1.946ArgArg: 1.946 ± 0.639
5.62ArgSer: 5.62 ± 1.101
1.946ArgThr: 1.946 ± 0.492
2.378ArgVal: 2.378 ± 0.447
0.865ArgTrp: 0.865 ± 0.401
1.729ArgTyr: 1.729 ± 0.514
0.0ArgXaa: 0.0 ± 0.0
Ser
2.378SerAla: 2.378 ± 0.418
1.729SerCys: 1.729 ± 0.361
4.323SerAsp: 4.323 ± 0.605
7.134SerGlu: 7.134 ± 1.52
2.378SerPhe: 2.378 ± 0.384
1.946SerGly: 1.946 ± 0.834
2.162SerHis: 2.162 ± 0.571
6.917SerIle: 6.917 ± 1.152
4.972SerLys: 4.972 ± 1.382
7.566SerLeu: 7.566 ± 1.279
1.729SerMet: 1.729 ± 0.481
4.54SerAsn: 4.54 ± 1.295
1.297SerPro: 1.297 ± 0.593
2.162SerGln: 2.162 ± 0.749
4.972SerArg: 4.972 ± 0.937
4.972SerSer: 4.972 ± 1.168
3.459SerThr: 3.459 ± 0.751
1.513SerVal: 1.513 ± 1.184
1.729SerTrp: 1.729 ± 0.527
3.675SerTyr: 3.675 ± 1.399
0.0SerXaa: 0.0 ± 0.0
Thr
0.649ThrAla: 0.649 ± 0.353
0.0ThrCys: 0.0 ± 0.0
1.513ThrAsp: 1.513 ± 0.424
4.107ThrGlu: 4.107 ± 1.394
0.649ThrPhe: 0.649 ± 0.497
3.026ThrGly: 3.026 ± 0.913
1.297ThrHis: 1.297 ± 0.464
4.54ThrIle: 4.54 ± 0.818
3.026ThrLys: 3.026 ± 0.946
1.297ThrLeu: 1.297 ± 0.568
1.513ThrMet: 1.513 ± 0.486
4.54ThrAsn: 4.54 ± 0.754
1.513ThrPro: 1.513 ± 0.513
0.649ThrGln: 0.649 ± 0.362
1.297ThrArg: 1.297 ± 0.638
3.459ThrSer: 3.459 ± 0.829
2.162ThrThr: 2.162 ± 0.405
3.243ThrVal: 3.243 ± 0.868
1.297ThrTrp: 1.297 ± 0.651
1.297ThrTyr: 1.297 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
1.513ValAla: 1.513 ± 0.664
1.081ValCys: 1.081 ± 0.343
2.594ValAsp: 2.594 ± 0.754
2.594ValGlu: 2.594 ± 1.183
1.729ValPhe: 1.729 ± 0.617
3.459ValGly: 3.459 ± 1.138
1.297ValHis: 1.297 ± 0.645
2.162ValIle: 2.162 ± 0.556
5.404ValLys: 5.404 ± 0.547
5.62ValLeu: 5.62 ± 1.405
1.297ValMet: 1.297 ± 0.411
4.107ValAsn: 4.107 ± 0.63
3.026ValPro: 3.026 ± 0.808
0.649ValGln: 0.649 ± 0.422
1.729ValArg: 1.729 ± 0.634
4.972ValSer: 4.972 ± 0.829
2.81ValThr: 2.81 ± 0.793
1.946ValVal: 1.946 ± 1.054
0.649ValTrp: 0.649 ± 0.325
2.81ValTyr: 2.81 ± 0.645
0.0ValXaa: 0.0 ± 0.0
Trp
0.216TrpAla: 0.216 ± 0.141
0.649TrpCys: 0.649 ± 0.301
1.081TrpAsp: 1.081 ± 0.449
2.378TrpGlu: 2.378 ± 0.65
0.865TrpPhe: 0.865 ± 0.401
1.513TrpGly: 1.513 ± 0.404
0.432TrpHis: 0.432 ± 0.222
1.297TrpIle: 1.297 ± 0.419
1.297TrpLys: 1.297 ± 0.532
0.865TrpLeu: 0.865 ± 0.371
0.216TrpMet: 0.216 ± 0.141
1.513TrpAsn: 1.513 ± 0.507
0.865TrpPro: 0.865 ± 0.401
0.432TrpGln: 0.432 ± 0.281
0.432TrpArg: 0.432 ± 0.313
0.865TrpSer: 0.865 ± 0.384
1.081TrpThr: 1.081 ± 0.509
1.297TrpVal: 1.297 ± 0.568
0.432TrpTrp: 0.432 ± 0.377
0.649TrpTyr: 0.649 ± 0.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.297TyrAla: 1.297 ± 0.399
0.432TyrCys: 0.432 ± 0.393
2.81TyrAsp: 2.81 ± 0.702
2.594TyrGlu: 2.594 ± 0.897
1.297TyrPhe: 1.297 ± 0.434
2.378TyrGly: 2.378 ± 0.455
1.297TyrHis: 1.297 ± 0.409
3.026TyrIle: 3.026 ± 0.573
3.459TyrLys: 3.459 ± 0.561
5.404TyrLeu: 5.404 ± 1.167
1.297TyrMet: 1.297 ± 0.563
3.459TyrAsn: 3.459 ± 0.469
1.729TyrPro: 1.729 ± 0.525
0.865TyrGln: 0.865 ± 0.354
1.513TyrArg: 1.513 ± 0.521
2.162TyrSer: 2.162 ± 0.435
1.513TyrThr: 1.513 ± 0.38
2.162TyrVal: 2.162 ± 0.387
0.216TyrTrp: 0.216 ± 0.244
3.243TyrTyr: 3.243 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski