Amino acid dipepetide frequency for Ohlsdorf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.349AlaAla: 1.349 ± 0.557
1.079AlaCys: 1.079 ± 0.445
2.967AlaAsp: 2.967 ± 0.642
1.888AlaGlu: 1.888 ± 0.749
1.079AlaPhe: 1.079 ± 0.592
1.888AlaGly: 1.888 ± 0.803
0.809AlaHis: 0.809 ± 0.37
5.125AlaIle: 5.125 ± 0.647
2.967AlaLys: 2.967 ± 2.084
4.586AlaLeu: 4.586 ± 0.845
0.809AlaMet: 0.809 ± 0.61
2.428AlaAsn: 2.428 ± 0.464
1.349AlaPro: 1.349 ± 0.356
2.698AlaGln: 2.698 ± 1.412
2.698AlaArg: 2.698 ± 1.114
4.046AlaSer: 4.046 ± 1.376
4.316AlaThr: 4.316 ± 1.023
1.079AlaVal: 1.079 ± 0.369
0.27AlaTrp: 0.27 ± 0.148
2.428AlaTyr: 2.428 ± 1.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.54CysAla: 0.54 ± 0.283
0.0CysCys: 0.0 ± 0.0
1.079CysAsp: 1.079 ± 0.669
1.349CysGlu: 1.349 ± 0.578
0.54CysPhe: 0.54 ± 0.296
0.0CysGly: 0.0 ± 0.0
0.27CysHis: 0.27 ± 0.343
0.54CysIle: 0.54 ± 0.536
0.809CysLys: 0.809 ± 0.504
1.079CysLeu: 1.079 ± 0.592
0.54CysMet: 0.54 ± 0.283
0.54CysAsn: 0.54 ± 0.731
0.27CysPro: 0.27 ± 0.148
1.079CysGln: 1.079 ± 0.366
0.27CysArg: 0.27 ± 0.343
2.158CysSer: 2.158 ± 1.491
1.349CysThr: 1.349 ± 0.471
0.27CysVal: 0.27 ± 0.148
0.54CysTrp: 0.54 ± 0.296
0.54CysTyr: 0.54 ± 0.296
0.0CysXaa: 0.0 ± 0.0
Asp
2.158AspAla: 2.158 ± 0.629
0.809AspCys: 0.809 ± 0.822
3.507AspAsp: 3.507 ± 1.817
1.619AspGlu: 1.619 ± 0.594
2.158AspPhe: 2.158 ± 0.869
2.158AspGly: 2.158 ± 1.11
1.079AspHis: 1.079 ± 0.354
4.316AspIle: 4.316 ± 1.247
4.046AspLys: 4.046 ± 0.84
7.014AspLeu: 7.014 ± 1.597
2.158AspMet: 2.158 ± 0.654
4.046AspAsn: 4.046 ± 0.519
3.237AspPro: 3.237 ± 0.664
2.967AspGln: 2.967 ± 1.27
2.698AspArg: 2.698 ± 0.98
2.698AspSer: 2.698 ± 0.809
3.237AspThr: 3.237 ± 0.885
2.158AspVal: 2.158 ± 0.695
0.809AspTrp: 0.809 ± 0.429
2.698AspTyr: 2.698 ± 0.988
0.0AspXaa: 0.0 ± 0.0
Glu
1.619GluAla: 1.619 ± 1.149
2.158GluCys: 2.158 ± 0.875
2.698GluAsp: 2.698 ± 0.712
2.428GluGlu: 2.428 ± 0.73
2.698GluPhe: 2.698 ± 1.124
3.237GluGly: 3.237 ± 1.004
2.158GluHis: 2.158 ± 0.603
3.507GluIle: 3.507 ± 1.001
3.237GluLys: 3.237 ± 0.979
6.744GluLeu: 6.744 ± 1.496
1.888GluMet: 1.888 ± 0.525
1.079GluAsn: 1.079 ± 0.32
1.619GluPro: 1.619 ± 0.673
1.079GluGln: 1.079 ± 0.708
1.619GluArg: 1.619 ± 0.371
6.204GluSer: 6.204 ± 1.543
2.428GluThr: 2.428 ± 0.583
3.777GluVal: 3.777 ± 1.051
1.079GluTrp: 1.079 ± 1.14
2.428GluTyr: 2.428 ± 0.9
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.546
0.27PheCys: 0.27 ± 0.148
2.698PheAsp: 2.698 ± 1.197
1.349PheGlu: 1.349 ± 0.7
2.698PhePhe: 2.698 ± 1.43
2.428PheGly: 2.428 ± 0.598
2.158PheHis: 2.158 ± 0.604
2.967PheIle: 2.967 ± 0.826
2.698PheLys: 2.698 ± 0.681
3.777PheLeu: 3.777 ± 1.076
0.809PheMet: 0.809 ± 0.36
2.158PheAsn: 2.158 ± 1.025
4.046PhePro: 4.046 ± 1.633
1.619PheGln: 1.619 ± 0.888
2.158PheArg: 2.158 ± 0.917
2.698PheSer: 2.698 ± 1.242
1.888PheThr: 1.888 ± 0.682
1.888PheVal: 1.888 ± 0.512
0.809PheTrp: 0.809 ± 0.3
1.888PheTyr: 1.888 ± 1.166
0.0PheXaa: 0.0 ± 0.0
Gly
0.809GlyAla: 0.809 ± 0.37
0.809GlyCys: 0.809 ± 0.61
4.046GlyAsp: 4.046 ± 1.0
4.316GlyGlu: 4.316 ± 1.626
2.428GlyPhe: 2.428 ± 1.211
4.316GlyGly: 4.316 ± 0.541
0.54GlyHis: 0.54 ± 0.296
2.698GlyIle: 2.698 ± 0.696
3.237GlyLys: 3.237 ± 0.824
6.744GlyLeu: 6.744 ± 1.734
1.619GlyMet: 1.619 ± 0.673
3.237GlyAsn: 3.237 ± 0.704
1.079GlyPro: 1.079 ± 0.366
1.349GlyGln: 1.349 ± 0.551
1.349GlyArg: 1.349 ± 0.96
4.046GlySer: 4.046 ± 1.653
2.428GlyThr: 2.428 ± 1.317
2.428GlyVal: 2.428 ± 0.856
1.079GlyTrp: 1.079 ± 0.366
1.079GlyTyr: 1.079 ± 0.354
0.0GlyXaa: 0.0 ± 0.0
His
2.158HisAla: 2.158 ± 0.629
0.0HisCys: 0.0 ± 0.0
0.54HisAsp: 0.54 ± 0.296
2.158HisGlu: 2.158 ± 0.629
0.809HisPhe: 0.809 ± 0.444
0.809HisGly: 0.809 ± 0.37
0.54HisHis: 0.54 ± 0.283
1.349HisIle: 1.349 ± 0.578
1.349HisLys: 1.349 ± 0.74
3.777HisLeu: 3.777 ± 0.492
0.809HisMet: 0.809 ± 0.61
1.888HisAsn: 1.888 ± 0.458
2.428HisPro: 2.428 ± 1.033
0.27HisGln: 0.27 ± 0.343
1.888HisArg: 1.888 ± 0.559
2.158HisSer: 2.158 ± 0.416
1.619HisThr: 1.619 ± 0.371
2.428HisVal: 2.428 ± 0.721
0.27HisTrp: 0.27 ± 0.148
0.54HisTyr: 0.54 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
2.967IleAla: 2.967 ± 0.811
1.349IleCys: 1.349 ± 0.331
3.507IleAsp: 3.507 ± 0.465
3.507IleGlu: 3.507 ± 0.666
3.237IlePhe: 3.237 ± 0.825
3.777IleGly: 3.777 ± 1.661
2.428IleHis: 2.428 ± 0.343
5.125IleIle: 5.125 ± 0.59
7.284IleLys: 7.284 ± 0.987
6.474IleLeu: 6.474 ± 2.539
1.079IleMet: 1.079 ± 0.32
5.665IleAsn: 5.665 ± 2.688
5.395IlePro: 5.395 ± 1.276
1.349IleGln: 1.349 ± 0.74
4.316IleArg: 4.316 ± 0.994
7.823IleSer: 7.823 ± 2.041
4.046IleThr: 4.046 ± 1.721
3.237IleVal: 3.237 ± 0.636
1.079IleTrp: 1.079 ± 0.746
1.888IleTyr: 1.888 ± 0.741
0.0IleXaa: 0.0 ± 0.0
Lys
2.158LysAla: 2.158 ± 0.76
1.349LysCys: 1.349 ± 0.471
2.698LysAsp: 2.698 ± 0.955
4.316LysGlu: 4.316 ± 0.972
2.698LysPhe: 2.698 ± 0.684
3.507LysGly: 3.507 ± 0.55
2.698LysHis: 2.698 ± 1.48
6.474LysIle: 6.474 ± 0.78
4.316LysLys: 4.316 ± 1.233
4.856LysLeu: 4.856 ± 1.441
1.619LysMet: 1.619 ± 0.909
4.046LysAsn: 4.046 ± 0.929
2.967LysPro: 2.967 ± 1.177
2.158LysGln: 2.158 ± 1.118
3.237LysArg: 3.237 ± 0.778
5.665LysSer: 5.665 ± 1.456
3.777LysThr: 3.777 ± 1.883
4.856LysVal: 4.856 ± 1.54
1.888LysTrp: 1.888 ± 0.559
2.967LysTyr: 2.967 ± 0.896
0.0LysXaa: 0.0 ± 0.0
Leu
6.204LeuAla: 6.204 ± 1.212
0.809LeuCys: 0.809 ± 0.652
4.316LeuAsp: 4.316 ± 1.235
4.856LeuGlu: 4.856 ± 1.104
3.507LeuPhe: 3.507 ± 0.933
6.474LeuGly: 6.474 ± 1.455
1.619LeuHis: 1.619 ± 0.566
10.79LeuIle: 10.79 ± 1.854
7.284LeuLys: 7.284 ± 1.253
8.363LeuLeu: 8.363 ± 2.898
3.237LeuMet: 3.237 ± 1.134
5.125LeuAsn: 5.125 ± 1.484
2.698LeuPro: 2.698 ± 0.496
2.967LeuGln: 2.967 ± 0.769
5.395LeuArg: 5.395 ± 1.745
7.553LeuSer: 7.553 ± 0.665
5.395LeuThr: 5.395 ± 0.846
4.316LeuVal: 4.316 ± 0.881
1.349LeuTrp: 1.349 ± 0.48
4.046LeuTyr: 4.046 ± 0.902
0.0LeuXaa: 0.0 ± 0.0
Met
1.619MetAla: 1.619 ± 0.607
0.54MetCys: 0.54 ± 0.689
1.619MetAsp: 1.619 ± 0.587
1.079MetGlu: 1.079 ± 0.445
1.349MetPhe: 1.349 ± 0.331
0.54MetGly: 0.54 ± 0.3
0.0MetHis: 0.0 ± 0.0
1.619MetIle: 1.619 ± 0.673
2.158MetLys: 2.158 ± 0.604
2.428MetLeu: 2.428 ± 1.022
0.809MetMet: 0.809 ± 1.028
1.619MetAsn: 1.619 ± 0.639
1.079MetPro: 1.079 ± 0.445
0.809MetGln: 0.809 ± 0.37
1.349MetArg: 1.349 ± 0.504
2.698MetSer: 2.698 ± 2.102
1.349MetThr: 1.349 ± 0.331
2.158MetVal: 2.158 ± 1.387
0.54MetTrp: 0.54 ± 0.296
1.619MetTyr: 1.619 ± 0.848
0.0MetXaa: 0.0 ± 0.0
Asn
2.967AsnAla: 2.967 ± 0.919
0.27AsnCys: 0.27 ± 0.148
3.237AsnAsp: 3.237 ± 0.977
2.158AsnGlu: 2.158 ± 0.259
2.158AsnPhe: 2.158 ± 0.968
3.777AsnGly: 3.777 ± 1.783
1.888AsnHis: 1.888 ± 0.47
2.967AsnIle: 2.967 ± 0.587
3.237AsnLys: 3.237 ± 1.237
6.204AsnLeu: 6.204 ± 0.555
1.888AsnMet: 1.888 ± 0.559
3.777AsnAsn: 3.777 ± 0.994
2.698AsnPro: 2.698 ± 0.712
2.158AsnGln: 2.158 ± 0.821
1.888AsnArg: 1.888 ± 1.265
5.125AsnSer: 5.125 ± 0.783
2.967AsnThr: 2.967 ± 0.771
3.777AsnVal: 3.777 ± 1.047
1.619AsnTrp: 1.619 ± 0.888
1.888AsnTyr: 1.888 ± 1.689
0.0AsnXaa: 0.0 ± 0.0
Pro
1.349ProAla: 1.349 ± 0.356
1.079ProCys: 1.079 ± 0.366
2.698ProAsp: 2.698 ± 0.5
3.507ProGlu: 3.507 ± 0.781
1.619ProPhe: 1.619 ± 0.6
2.158ProGly: 2.158 ± 0.704
1.079ProHis: 1.079 ± 0.32
2.698ProIle: 2.698 ± 1.018
3.237ProLys: 3.237 ± 0.729
4.856ProLeu: 4.856 ± 1.05
0.54ProMet: 0.54 ± 0.27
3.237ProAsn: 3.237 ± 1.14
3.237ProPro: 3.237 ± 1.256
1.888ProGln: 1.888 ± 0.805
3.237ProArg: 3.237 ± 0.832
4.586ProSer: 4.586 ± 0.962
2.428ProThr: 2.428 ± 0.853
1.619ProVal: 1.619 ± 0.629
0.27ProTrp: 0.27 ± 0.148
2.158ProTyr: 2.158 ± 1.387
0.0ProXaa: 0.0 ± 0.0
Gln
1.079GlnAla: 1.079 ± 0.445
0.27GlnCys: 0.27 ± 0.148
2.698GlnAsp: 2.698 ± 0.928
2.158GlnGlu: 2.158 ± 1.318
1.079GlnPhe: 1.079 ± 1.081
2.158GlnGly: 2.158 ± 0.654
1.349GlnHis: 1.349 ± 0.551
0.809GlnIle: 0.809 ± 0.61
1.888GlnLys: 1.888 ± 0.797
2.698GlnLeu: 2.698 ± 0.889
0.27GlnMet: 0.27 ± 0.498
1.619GlnAsn: 1.619 ± 0.594
0.809GlnPro: 0.809 ± 0.37
0.809GlnGln: 0.809 ± 0.37
1.079GlnArg: 1.079 ± 0.369
4.046GlnSer: 4.046 ± 0.508
3.777GlnThr: 3.777 ± 0.918
1.349GlnVal: 1.349 ± 0.551
0.54GlnTrp: 0.54 ± 0.296
1.079GlnTyr: 1.079 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
2.698ArgAla: 2.698 ± 0.832
0.54ArgCys: 0.54 ± 0.283
3.777ArgAsp: 3.777 ± 0.611
2.428ArgGlu: 2.428 ± 0.343
3.507ArgPhe: 3.507 ± 0.382
1.888ArgGly: 1.888 ± 0.559
2.158ArgHis: 2.158 ± 0.416
2.698ArgIle: 2.698 ± 0.677
3.777ArgLys: 3.777 ± 0.254
2.967ArgLeu: 2.967 ± 1.021
1.079ArgMet: 1.079 ± 0.592
2.428ArgAsn: 2.428 ± 0.757
2.428ArgPro: 2.428 ± 0.607
0.809ArgGln: 0.809 ± 0.408
1.079ArgArg: 1.079 ± 0.366
2.967ArgSer: 2.967 ± 0.688
2.967ArgThr: 2.967 ± 0.776
4.586ArgVal: 4.586 ± 0.789
0.809ArgTrp: 0.809 ± 0.3
2.158ArgTyr: 2.158 ± 0.259
0.0ArgXaa: 0.0 ± 0.0
Ser
5.395SerAla: 5.395 ± 1.332
0.809SerCys: 0.809 ± 1.096
4.316SerAsp: 4.316 ± 0.878
4.586SerGlu: 4.586 ± 0.528
3.507SerPhe: 3.507 ± 1.262
4.046SerGly: 4.046 ± 1.492
0.809SerHis: 0.809 ± 0.408
7.014SerIle: 7.014 ± 1.969
5.125SerLys: 5.125 ± 0.817
8.093SerLeu: 8.093 ± 2.702
2.158SerMet: 2.158 ± 0.738
3.237SerAsn: 3.237 ± 0.793
5.125SerPro: 5.125 ± 1.941
0.809SerGln: 0.809 ± 0.3
5.395SerArg: 5.395 ± 1.74
6.474SerSer: 6.474 ± 1.048
6.744SerThr: 6.744 ± 1.26
4.586SerVal: 4.586 ± 1.283
2.158SerTrp: 2.158 ± 0.732
4.856SerTyr: 4.856 ± 0.84
0.0SerXaa: 0.0 ± 0.0
Thr
2.428ThrAla: 2.428 ± 0.693
0.54ThrCys: 0.54 ± 0.296
2.698ThrAsp: 2.698 ± 1.077
3.777ThrGlu: 3.777 ± 0.97
2.428ThrPhe: 2.428 ± 1.412
3.237ThrGly: 3.237 ± 1.345
3.237ThrHis: 3.237 ± 0.885
4.586ThrIle: 4.586 ± 1.303
4.046ThrLys: 4.046 ± 1.684
5.395ThrLeu: 5.395 ± 0.915
1.079ThrMet: 1.079 ± 0.708
4.316ThrAsn: 4.316 ± 0.681
2.158ThrPro: 2.158 ± 0.728
2.428ThrGln: 2.428 ± 0.464
2.967ThrArg: 2.967 ± 1.006
4.586ThrSer: 4.586 ± 0.67
2.967ThrThr: 2.967 ± 1.223
5.125ThrVal: 5.125 ± 1.661
0.54ThrTrp: 0.54 ± 0.296
2.428ThrTyr: 2.428 ± 0.564
0.0ThrXaa: 0.0 ± 0.0
Val
2.967ValAla: 2.967 ± 1.606
0.54ValCys: 0.54 ± 0.345
3.507ValAsp: 3.507 ± 0.92
2.158ValGlu: 2.158 ± 0.489
4.046ValPhe: 4.046 ± 1.13
1.619ValGly: 1.619 ± 0.594
0.54ValHis: 0.54 ± 0.3
6.204ValIle: 6.204 ± 1.095
3.777ValLys: 3.777 ± 1.533
4.586ValLeu: 4.586 ± 0.74
2.158ValMet: 2.158 ± 0.49
3.507ValAsn: 3.507 ± 1.673
2.158ValPro: 2.158 ± 0.259
2.698ValGln: 2.698 ± 0.376
1.888ValArg: 1.888 ± 0.693
4.046ValSer: 4.046 ± 0.716
4.046ValThr: 4.046 ± 2.135
4.316ValVal: 4.316 ± 1.594
0.0ValTrp: 0.0 ± 0.0
1.888ValTyr: 1.888 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.296
0.0TrpCys: 0.0 ± 0.0
0.809TrpAsp: 0.809 ± 0.444
1.349TrpGlu: 1.349 ± 0.74
0.809TrpPhe: 0.809 ± 0.429
0.27TrpGly: 0.27 ± 0.148
0.54TrpHis: 0.54 ± 0.296
1.619TrpIle: 1.619 ± 0.592
1.619TrpLys: 1.619 ± 0.888
1.349TrpLeu: 1.349 ± 0.628
0.809TrpMet: 0.809 ± 0.3
1.079TrpAsn: 1.079 ± 0.369
0.809TrpPro: 0.809 ± 0.3
0.0TrpGln: 0.0 ± 0.0
0.27TrpArg: 0.27 ± 0.148
2.158TrpSer: 2.158 ± 0.732
0.809TrpThr: 0.809 ± 0.61
0.54TrpVal: 0.54 ± 0.731
0.0TrpTrp: 0.0 ± 0.0
0.54TrpTyr: 0.54 ± 0.731
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.888TyrAla: 1.888 ± 0.554
0.54TyrCys: 0.54 ± 0.3
2.158TyrAsp: 2.158 ± 0.618
2.698TyrGlu: 2.698 ± 0.684
1.619TyrPhe: 1.619 ± 0.371
1.349TyrGly: 1.349 ± 0.74
1.888TyrHis: 1.888 ± 0.833
2.428TyrIle: 2.428 ± 1.29
2.158TyrLys: 2.158 ± 1.124
4.316TyrLeu: 4.316 ± 1.407
1.349TyrMet: 1.349 ± 0.598
1.619TyrAsn: 1.619 ± 1.17
1.888TyrPro: 1.888 ± 0.427
1.888TyrGln: 1.888 ± 1.026
2.967TyrArg: 2.967 ± 1.313
3.507TyrSer: 3.507 ± 1.142
2.428TyrThr: 2.428 ± 0.615
2.158TyrVal: 2.158 ± 0.777
0.27TyrTrp: 0.27 ± 0.365
1.349TyrTyr: 1.349 ± 0.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3708 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski