Amino acid dipepetide frequency for Olivier s shrew virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.202AlaAla: 5.202 ± 0.789
2.378AlaCys: 2.378 ± 0.512
2.527AlaAsp: 2.527 ± 0.411
3.567AlaGlu: 3.567 ± 0.631
3.567AlaPhe: 3.567 ± 0.829
6.986AlaGly: 6.986 ± 1.642
2.378AlaHis: 2.378 ± 0.841
4.459AlaIle: 4.459 ± 0.975
4.31AlaLys: 4.31 ± 0.57
9.661AlaLeu: 9.661 ± 1.085
1.338AlaMet: 1.338 ± 0.356
1.932AlaAsn: 1.932 ± 0.407
2.973AlaPro: 2.973 ± 0.501
0.743AlaGln: 0.743 ± 0.427
5.054AlaArg: 5.054 ± 1.121
8.026AlaSer: 8.026 ± 1.265
4.013AlaThr: 4.013 ± 0.718
4.905AlaVal: 4.905 ± 0.88
1.04AlaTrp: 1.04 ± 0.355
1.635AlaTyr: 1.635 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
1.932CysAla: 1.932 ± 0.716
1.338CysCys: 1.338 ± 0.48
2.973CysAsp: 2.973 ± 0.69
0.892CysGlu: 0.892 ± 0.389
1.04CysPhe: 1.04 ± 0.711
2.675CysGly: 2.675 ± 0.326
1.04CysHis: 1.04 ± 0.265
0.595CysIle: 0.595 ± 0.301
1.635CysLys: 1.635 ± 1.154
4.013CysLeu: 4.013 ± 0.766
0.149CysMet: 0.149 ± 0.36
1.189CysAsn: 1.189 ± 0.342
1.486CysPro: 1.486 ± 0.53
0.149CysGln: 0.149 ± 0.096
1.635CysArg: 1.635 ± 0.309
2.378CysSer: 2.378 ± 0.616
2.824CysThr: 2.824 ± 0.429
3.567CysVal: 3.567 ± 0.749
1.189CysTrp: 1.189 ± 0.489
1.784CysTyr: 1.784 ± 0.371
0.0CysXaa: 0.0 ± 0.0
Asp
3.419AspAla: 3.419 ± 0.829
1.784AspCys: 1.784 ± 0.382
2.527AspAsp: 2.527 ± 0.621
2.824AspGlu: 2.824 ± 0.987
3.27AspPhe: 3.27 ± 0.794
3.567AspGly: 3.567 ± 1.269
1.04AspHis: 1.04 ± 0.561
1.486AspIle: 1.486 ± 0.501
2.081AspLys: 2.081 ± 0.523
5.351AspLeu: 5.351 ± 0.775
0.892AspMet: 0.892 ± 0.337
1.189AspAsn: 1.189 ± 0.4
3.121AspPro: 3.121 ± 1.049
1.486AspGln: 1.486 ± 0.571
2.824AspArg: 2.824 ± 0.671
2.527AspSer: 2.527 ± 0.588
3.121AspThr: 3.121 ± 0.698
4.013AspVal: 4.013 ± 0.854
1.486AspTrp: 1.486 ± 0.501
1.486AspTyr: 1.486 ± 0.425
0.0AspXaa: 0.0 ± 0.0
Glu
2.973GluAla: 2.973 ± 0.544
0.892GluCys: 0.892 ± 0.337
2.229GluAsp: 2.229 ± 0.592
1.338GluGlu: 1.338 ± 0.563
2.527GluPhe: 2.527 ± 0.535
4.459GluGly: 4.459 ± 1.174
1.189GluHis: 1.189 ± 0.288
0.595GluIle: 0.595 ± 0.386
0.892GluLys: 0.892 ± 0.286
4.162GluLeu: 4.162 ± 0.298
0.595GluMet: 0.595 ± 0.232
0.149GluAsn: 0.149 ± 0.096
1.784GluPro: 1.784 ± 0.612
1.486GluGln: 1.486 ± 0.53
2.378GluArg: 2.378 ± 0.676
1.635GluSer: 1.635 ± 0.293
1.635GluThr: 1.635 ± 0.362
2.527GluVal: 2.527 ± 0.511
0.595GluTrp: 0.595 ± 0.296
0.743GluTyr: 0.743 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
4.905PheAla: 4.905 ± 0.649
1.784PheCys: 1.784 ± 0.351
2.675PheAsp: 2.675 ± 0.721
0.743PheGlu: 0.743 ± 0.318
1.932PhePhe: 1.932 ± 0.715
4.013PheGly: 4.013 ± 1.017
1.189PheHis: 1.189 ± 0.641
0.892PheIle: 0.892 ± 0.963
0.595PheLys: 0.595 ± 0.972
5.499PheLeu: 5.499 ± 0.482
0.892PheMet: 0.892 ± 0.389
1.784PheAsn: 1.784 ± 0.292
4.013PhePro: 4.013 ± 0.868
1.338PheGln: 1.338 ± 0.513
1.486PheArg: 1.486 ± 0.501
3.27PheSer: 3.27 ± 0.411
1.635PheThr: 1.635 ± 1.259
4.013PheVal: 4.013 ± 0.851
2.081PheTrp: 2.081 ± 0.538
0.743PheTyr: 0.743 ± 0.307
0.0PheXaa: 0.0 ± 0.0
Gly
6.094GlyAla: 6.094 ± 1.089
2.824GlyCys: 2.824 ± 0.622
5.797GlyAsp: 5.797 ± 1.225
2.675GlyGlu: 2.675 ± 0.651
3.121GlyPhe: 3.121 ± 0.462
4.31GlyGly: 4.31 ± 1.116
1.635GlyHis: 1.635 ± 0.434
3.419GlyIle: 3.419 ± 0.504
5.351GlyLys: 5.351 ± 1.24
8.918GlyLeu: 8.918 ± 1.751
0.743GlyMet: 0.743 ± 0.34
2.378GlyAsn: 2.378 ± 0.585
3.716GlyPro: 3.716 ± 0.868
1.189GlyGln: 1.189 ± 0.393
1.932GlyArg: 1.932 ± 0.554
8.026GlySer: 8.026 ± 1.193
5.202GlyThr: 5.202 ± 0.78
8.026GlyVal: 8.026 ± 1.102
0.446GlyTrp: 0.446 ± 0.88
4.013GlyTyr: 4.013 ± 0.61
0.0GlyXaa: 0.0 ± 0.0
His
1.338HisAla: 1.338 ± 0.498
1.635HisCys: 1.635 ± 0.309
1.338HisAsp: 1.338 ± 0.449
0.446HisGlu: 0.446 ± 0.292
0.892HisPhe: 0.892 ± 0.306
2.527HisGly: 2.527 ± 1.001
1.338HisHis: 1.338 ± 0.337
1.04HisIle: 1.04 ± 0.59
0.743HisLys: 0.743 ± 0.476
3.27HisLeu: 3.27 ± 1.751
0.149HisMet: 0.149 ± 0.413
1.486HisAsn: 1.486 ± 0.428
1.784HisPro: 1.784 ± 0.46
0.595HisGln: 0.595 ± 0.606
1.04HisArg: 1.04 ± 0.265
1.635HisSer: 1.635 ± 0.989
1.189HisThr: 1.189 ± 0.42
0.595HisVal: 0.595 ± 0.46
1.04HisTrp: 1.04 ± 0.288
1.04HisTyr: 1.04 ± 0.711
0.0HisXaa: 0.0 ± 0.0
Ile
2.675IleAla: 2.675 ± 0.461
1.784IleCys: 1.784 ± 0.446
1.338IleAsp: 1.338 ± 0.375
2.081IleGlu: 2.081 ± 0.498
1.932IlePhe: 1.932 ± 0.686
3.716IleGly: 3.716 ± 0.578
1.04IleHis: 1.04 ± 0.437
2.824IleIle: 2.824 ± 0.389
2.973IleLys: 2.973 ± 0.495
4.756IleLeu: 4.756 ± 1.002
1.04IleMet: 1.04 ± 0.905
1.932IleAsn: 1.932 ± 0.366
2.675IlePro: 2.675 ± 0.542
0.595IleGln: 0.595 ± 0.23
1.932IleArg: 1.932 ± 0.687
1.784IleSer: 1.784 ± 0.541
3.121IleThr: 3.121 ± 0.697
3.864IleVal: 3.864 ± 1.597
0.149IleTrp: 0.149 ± 0.096
2.527IleTyr: 2.527 ± 0.523
0.0IleXaa: 0.0 ± 0.0
Lys
3.716LysAla: 3.716 ± 0.981
1.189LysCys: 1.189 ± 0.647
1.784LysAsp: 1.784 ± 0.612
0.297LysGlu: 0.297 ± 0.112
1.932LysPhe: 1.932 ± 0.364
3.27LysGly: 3.27 ± 0.796
0.595LysHis: 0.595 ± 0.626
2.527LysIle: 2.527 ± 0.277
2.081LysLys: 2.081 ± 0.624
4.905LysLeu: 4.905 ± 1.124
0.743LysMet: 0.743 ± 0.825
1.635LysAsn: 1.635 ± 1.619
2.378LysPro: 2.378 ± 0.375
1.189LysGln: 1.189 ± 1.477
3.27LysArg: 3.27 ± 0.58
2.824LysSer: 2.824 ± 0.495
2.824LysThr: 2.824 ± 0.599
3.716LysVal: 3.716 ± 0.933
0.297LysTrp: 0.297 ± 0.275
1.784LysTyr: 1.784 ± 0.762
0.0LysXaa: 0.0 ± 0.0
Leu
10.999LeuAla: 10.999 ± 1.608
4.162LeuCys: 4.162 ± 1.487
6.094LeuAsp: 6.094 ± 1.065
4.608LeuGlu: 4.608 ± 0.881
2.824LeuPhe: 2.824 ± 1.143
7.432LeuGly: 7.432 ± 1.247
2.824LeuHis: 2.824 ± 0.485
2.675LeuIle: 2.675 ± 1.449
3.121LeuLys: 3.121 ± 1.146
14.566LeuLeu: 14.566 ± 1.248
2.675LeuMet: 2.675 ± 1.329
4.31LeuAsn: 4.31 ± 0.908
6.986LeuPro: 6.986 ± 1.563
1.932LeuGln: 1.932 ± 0.715
5.648LeuArg: 5.648 ± 0.658
9.661LeuSer: 9.661 ± 0.889
6.094LeuThr: 6.094 ± 0.647
9.364LeuVal: 9.364 ± 1.096
1.932LeuTrp: 1.932 ± 0.863
2.527LeuTyr: 2.527 ± 1.088
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.438
0.446MetCys: 0.446 ± 0.283
0.297MetAsp: 0.297 ± 0.455
1.189MetGlu: 1.189 ± 0.591
0.446MetPhe: 0.446 ± 0.566
1.04MetGly: 1.04 ± 0.647
0.297MetHis: 0.297 ± 0.444
0.743MetIle: 0.743 ± 0.251
0.892MetLys: 0.892 ± 0.741
2.081MetLeu: 2.081 ± 0.782
0.0MetMet: 0.0 ± 0.0
0.446MetAsn: 0.446 ± 0.314
0.892MetPro: 0.892 ± 0.23
0.892MetGln: 0.892 ± 0.26
0.595MetArg: 0.595 ± 0.466
0.149MetSer: 0.149 ± 0.096
1.635MetThr: 1.635 ± 1.135
1.784MetVal: 1.784 ± 1.136
0.297MetTrp: 0.297 ± 0.112
0.297MetTyr: 0.297 ± 0.462
0.0MetXaa: 0.0 ± 0.0
Asn
1.338AsnAla: 1.338 ± 0.458
1.338AsnCys: 1.338 ± 0.325
1.635AsnAsp: 1.635 ± 0.576
1.189AsnGlu: 1.189 ± 0.4
1.932AsnPhe: 1.932 ± 0.322
4.756AsnGly: 4.756 ± 0.743
0.892AsnHis: 0.892 ± 0.444
1.635AsnIle: 1.635 ± 0.362
2.378AsnLys: 2.378 ± 0.355
4.162AsnLeu: 4.162 ± 0.958
0.297AsnMet: 0.297 ± 0.112
1.338AsnAsn: 1.338 ± 0.904
1.784AsnPro: 1.784 ± 0.416
1.486AsnGln: 1.486 ± 0.608
1.04AsnArg: 1.04 ± 0.265
2.081AsnSer: 2.081 ± 0.764
2.527AsnThr: 2.527 ± 0.823
1.932AsnVal: 1.932 ± 0.567
0.149AsnTrp: 0.149 ± 0.259
0.892AsnTyr: 0.892 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
4.31ProAla: 4.31 ± 0.964
2.081ProCys: 2.081 ± 0.461
3.864ProAsp: 3.864 ± 0.561
1.635ProGlu: 1.635 ± 0.449
3.27ProPhe: 3.27 ± 0.747
5.499ProGly: 5.499 ± 0.802
1.04ProHis: 1.04 ± 0.379
2.378ProIle: 2.378 ± 0.656
2.973ProLys: 2.973 ± 0.699
4.608ProLeu: 4.608 ± 0.845
0.297ProMet: 0.297 ± 0.254
2.527ProAsn: 2.527 ± 0.684
6.688ProPro: 6.688 ± 1.956
1.635ProGln: 1.635 ± 0.405
2.527ProArg: 2.527 ± 0.585
3.864ProSer: 3.864 ± 0.762
4.013ProThr: 4.013 ± 0.611
7.283ProVal: 7.283 ± 2.259
0.446ProTrp: 0.446 ± 0.217
1.932ProTyr: 1.932 ± 0.775
0.0ProXaa: 0.0 ± 0.0
Gln
2.378GlnAla: 2.378 ± 0.61
0.297GlnCys: 0.297 ± 0.112
0.892GlnAsp: 0.892 ± 0.306
0.743GlnGlu: 0.743 ± 0.251
0.149GlnPhe: 0.149 ± 0.363
2.527GlnGly: 2.527 ± 0.727
1.04GlnHis: 1.04 ± 0.294
0.297GlnIle: 0.297 ± 0.275
1.04GlnLys: 1.04 ± 0.411
3.121GlnLeu: 3.121 ± 0.558
0.595GlnMet: 0.595 ± 0.232
0.297GlnAsn: 0.297 ± 0.193
2.081GlnPro: 2.081 ± 0.577
1.486GlnGln: 1.486 ± 0.422
1.486GlnArg: 1.486 ± 0.598
1.486GlnSer: 1.486 ± 0.331
1.486GlnThr: 1.486 ± 0.316
2.675GlnVal: 2.675 ± 0.741
0.892GlnTrp: 0.892 ± 0.306
0.892GlnTyr: 0.892 ± 0.469
0.0GlnXaa: 0.0 ± 0.0
Arg
2.675ArgAla: 2.675 ± 0.494
1.189ArgCys: 1.189 ± 0.277
2.675ArgAsp: 2.675 ± 0.557
2.378ArgGlu: 2.378 ± 0.371
3.121ArgPhe: 3.121 ± 0.874
3.27ArgGly: 3.27 ± 0.497
1.04ArgHis: 1.04 ± 1.368
3.121ArgIle: 3.121 ± 0.648
2.081ArgLys: 2.081 ± 0.772
4.459ArgLeu: 4.459 ± 0.75
1.338ArgMet: 1.338 ± 0.325
1.486ArgAsn: 1.486 ± 0.374
3.419ArgPro: 3.419 ± 0.526
0.743ArgGln: 0.743 ± 0.252
3.121ArgArg: 3.121 ± 1.049
2.824ArgSer: 2.824 ± 0.521
3.567ArgThr: 3.567 ± 1.038
2.527ArgVal: 2.527 ± 0.828
0.297ArgTrp: 0.297 ± 0.394
1.486ArgTyr: 1.486 ± 0.614
0.0ArgXaa: 0.0 ± 0.0
Ser
7.58SerAla: 7.58 ± 0.932
1.486SerCys: 1.486 ± 0.321
2.973SerAsp: 2.973 ± 0.838
1.635SerGlu: 1.635 ± 0.449
3.716SerPhe: 3.716 ± 0.649
4.905SerGly: 4.905 ± 1.106
1.189SerHis: 1.189 ± 0.989
3.864SerIle: 3.864 ± 1.417
2.973SerLys: 2.973 ± 0.669
7.58SerLeu: 7.58 ± 1.163
2.081SerMet: 2.081 ± 0.75
3.27SerAsn: 3.27 ± 0.633
4.756SerPro: 4.756 ± 0.638
2.081SerGln: 2.081 ± 0.714
3.716SerArg: 3.716 ± 0.628
4.162SerSer: 4.162 ± 1.913
4.31SerThr: 4.31 ± 1.025
6.391SerVal: 6.391 ± 0.446
1.486SerTrp: 1.486 ± 0.486
1.932SerTyr: 1.932 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.499ThrAla: 5.499 ± 0.747
2.081ThrCys: 2.081 ± 0.34
1.932ThrAsp: 1.932 ± 0.497
1.932ThrGlu: 1.932 ± 0.514
2.378ThrPhe: 2.378 ± 0.729
5.351ThrGly: 5.351 ± 1.389
0.892ThrHis: 0.892 ± 0.739
4.013ThrIle: 4.013 ± 0.893
2.081ThrLys: 2.081 ± 0.426
5.351ThrLeu: 5.351 ± 1.801
1.04ThrMet: 1.04 ± 0.791
2.378ThrAsn: 2.378 ± 0.613
5.054ThrPro: 5.054 ± 1.001
2.081ThrGln: 2.081 ± 0.331
1.486ThrArg: 1.486 ± 0.506
6.094ThrSer: 6.094 ± 1.176
4.756ThrThr: 4.756 ± 0.833
5.648ThrVal: 5.648 ± 1.37
0.892ThrTrp: 0.892 ± 0.73
1.338ThrTyr: 1.338 ± 1.246
0.0ThrXaa: 0.0 ± 0.0
Val
4.459ValAla: 4.459 ± 0.414
3.27ValCys: 3.27 ± 0.641
4.162ValAsp: 4.162 ± 0.641
2.973ValGlu: 2.973 ± 0.863
3.864ValPhe: 3.864 ± 0.552
6.094ValGly: 6.094 ± 1.009
2.527ValHis: 2.527 ± 0.534
5.202ValIle: 5.202 ± 1.2
2.973ValLys: 2.973 ± 0.72
8.026ValLeu: 8.026 ± 0.788
0.892ValMet: 0.892 ± 0.492
3.419ValAsn: 3.419 ± 0.672
5.945ValPro: 5.945 ± 0.886
2.973ValGln: 2.973 ± 0.378
3.419ValArg: 3.419 ± 1.186
6.54ValSer: 6.54 ± 1.755
6.243ValThr: 6.243 ± 1.079
7.134ValVal: 7.134 ± 1.946
1.338ValTrp: 1.338 ± 0.726
2.229ValTyr: 2.229 ± 0.71
0.0ValXaa: 0.0 ± 0.0
Trp
0.595TrpAla: 0.595 ± 0.224
0.892TrpCys: 0.892 ± 0.739
0.892TrpAsp: 0.892 ± 0.306
0.446TrpGlu: 0.446 ± 0.153
1.189TrpPhe: 1.189 ± 0.277
0.892TrpGly: 0.892 ± 0.571
0.0TrpHis: 0.0 ± 0.0
0.892TrpIle: 0.892 ± 0.26
0.892TrpLys: 0.892 ± 0.306
2.973TrpLeu: 2.973 ± 0.391
0.149TrpMet: 0.149 ± 0.096
0.743TrpAsn: 0.743 ± 0.451
0.743TrpPro: 0.743 ± 0.307
0.595TrpGln: 0.595 ± 0.23
1.338TrpArg: 1.338 ± 0.834
0.743TrpSer: 0.743 ± 0.992
1.04TrpThr: 1.04 ± 0.542
1.189TrpVal: 1.189 ± 1.007
1.04TrpTrp: 1.04 ± 0.437
0.892TrpTyr: 0.892 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.824TyrAla: 2.824 ± 0.631
1.635TyrCys: 1.635 ± 0.293
0.892TyrAsp: 0.892 ± 0.603
1.189TyrGlu: 1.189 ± 0.429
2.378TyrPhe: 2.378 ± 1.236
2.675TyrGly: 2.675 ± 1.453
1.932TyrHis: 1.932 ± 0.772
2.081TyrIle: 2.081 ± 0.718
0.892TyrLys: 0.892 ± 0.346
2.973TyrLeu: 2.973 ± 0.709
0.149TyrMet: 0.149 ± 0.406
0.892TyrAsn: 0.892 ± 0.349
0.595TyrPro: 0.595 ± 0.376
1.189TyrGln: 1.189 ± 0.46
0.892TyrArg: 0.892 ± 0.578
2.378TyrSer: 2.378 ± 1.016
1.04TyrThr: 1.04 ± 0.357
2.527TyrVal: 2.527 ± 0.555
1.04TyrTrp: 1.04 ± 0.363
3.121TyrTyr: 3.121 ± 0.899
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6729 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski