Amino acid dipepetide frequency for Leuconostoc phage CHB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.59AlaAla: 0.59 ± 0.246
0.354AlaCys: 0.354 ± 0.202
3.895AlaAsp: 3.895 ± 0.698
3.187AlaGlu: 3.187 ± 0.51
2.951AlaPhe: 2.951 ± 0.761
4.131AlaGly: 4.131 ± 0.965
0.708AlaHis: 0.708 ± 0.269
4.957AlaIle: 4.957 ± 1.06
4.839AlaLys: 4.839 ± 0.76
6.255AlaLeu: 6.255 ± 0.987
1.416AlaMet: 1.416 ± 0.406
3.659AlaAsn: 3.659 ± 0.63
1.652AlaPro: 1.652 ± 0.474
1.77AlaGln: 1.77 ± 0.511
1.416AlaArg: 1.416 ± 0.386
3.423AlaSer: 3.423 ± 1.023
4.131AlaThr: 4.131 ± 0.656
3.659AlaVal: 3.659 ± 0.655
1.062AlaTrp: 1.062 ± 0.455
1.652AlaTyr: 1.652 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.236CysAla: 0.236 ± 0.153
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.354CysGlu: 0.354 ± 0.168
0.236CysPhe: 0.236 ± 0.159
0.354CysGly: 0.354 ± 0.278
0.354CysHis: 0.354 ± 0.22
0.236CysIle: 0.236 ± 0.162
0.118CysLys: 0.118 ± 0.119
0.236CysLeu: 0.236 ± 0.163
0.118CysMet: 0.118 ± 0.119
0.236CysAsn: 0.236 ± 0.142
0.0CysPro: 0.0 ± 0.0
0.118CysGln: 0.118 ± 0.099
0.118CysArg: 0.118 ± 0.118
0.354CysSer: 0.354 ± 0.217
0.0CysThr: 0.0 ± 0.0
0.236CysVal: 0.236 ± 0.232
0.236CysTrp: 0.236 ± 0.157
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.305AspAla: 3.305 ± 0.713
0.236AspCys: 0.236 ± 0.165
4.721AspAsp: 4.721 ± 1.132
5.665AspGlu: 5.665 ± 0.961
2.596AspPhe: 2.596 ± 0.657
4.367AspGly: 4.367 ± 0.624
1.416AspHis: 1.416 ± 0.422
4.957AspIle: 4.957 ± 0.714
4.603AspLys: 4.603 ± 0.885
4.485AspLeu: 4.485 ± 0.691
1.416AspMet: 1.416 ± 0.353
5.547AspAsn: 5.547 ± 0.878
2.124AspPro: 2.124 ± 0.445
1.062AspGln: 1.062 ± 0.327
2.124AspArg: 2.124 ± 0.443
3.659AspSer: 3.659 ± 0.617
3.541AspThr: 3.541 ± 0.702
5.193AspVal: 5.193 ± 0.747
1.298AspTrp: 1.298 ± 0.32
4.013AspTyr: 4.013 ± 0.827
0.0AspXaa: 0.0 ± 0.0
Glu
2.242GluAla: 2.242 ± 0.654
0.118GluCys: 0.118 ± 0.142
3.777GluAsp: 3.777 ± 0.673
4.485GluGlu: 4.485 ± 1.217
2.715GluPhe: 2.715 ± 0.628
2.478GluGly: 2.478 ± 0.607
1.062GluHis: 1.062 ± 0.289
3.777GluIle: 3.777 ± 0.755
4.721GluLys: 4.721 ± 0.895
5.783GluLeu: 5.783 ± 0.996
3.069GluMet: 3.069 ± 0.687
4.603GluAsn: 4.603 ± 0.84
0.826GluPro: 0.826 ± 0.284
2.596GluGln: 2.596 ± 0.566
2.006GluArg: 2.006 ± 0.571
2.715GluSer: 2.715 ± 0.559
3.305GluThr: 3.305 ± 0.623
5.783GluVal: 5.783 ± 0.868
0.944GluTrp: 0.944 ± 0.337
2.951GluTyr: 2.951 ± 0.593
0.0GluXaa: 0.0 ± 0.0
Phe
2.006PheAla: 2.006 ± 0.57
0.118PheCys: 0.118 ± 0.116
4.721PheAsp: 4.721 ± 0.707
3.187PheGlu: 3.187 ± 0.686
0.944PhePhe: 0.944 ± 0.367
2.951PheGly: 2.951 ± 0.414
0.236PheHis: 0.236 ± 0.177
3.895PheIle: 3.895 ± 0.5
2.478PheLys: 2.478 ± 0.496
2.833PheLeu: 2.833 ± 0.556
1.062PheMet: 1.062 ± 0.323
2.715PheAsn: 2.715 ± 0.539
1.062PhePro: 1.062 ± 0.381
0.826PheGln: 0.826 ± 0.306
1.416PheArg: 1.416 ± 0.375
5.665PheSer: 5.665 ± 1.793
3.423PheThr: 3.423 ± 0.616
2.36PheVal: 2.36 ± 0.382
0.472PheTrp: 0.472 ± 0.284
2.124PheTyr: 2.124 ± 0.492
0.0PheXaa: 0.0 ± 0.0
Gly
3.187GlyAla: 3.187 ± 0.724
0.236GlyCys: 0.236 ± 0.158
4.603GlyAsp: 4.603 ± 0.678
2.478GlyGlu: 2.478 ± 0.524
4.013GlyPhe: 4.013 ± 0.645
5.311GlyGly: 5.311 ± 1.237
0.826GlyHis: 0.826 ± 0.311
5.547GlyIle: 5.547 ± 2.069
6.137GlyLys: 6.137 ± 0.92
5.783GlyLeu: 5.783 ± 0.707
2.242GlyMet: 2.242 ± 0.46
3.659GlyAsn: 3.659 ± 0.547
0.236GlyPro: 0.236 ± 0.16
2.242GlyGln: 2.242 ± 0.453
3.305GlyArg: 3.305 ± 0.673
5.665GlySer: 5.665 ± 1.109
3.777GlyThr: 3.777 ± 0.657
4.839GlyVal: 4.839 ± 1.014
0.472GlyTrp: 0.472 ± 0.19
3.777GlyTyr: 3.777 ± 0.716
0.0GlyXaa: 0.0 ± 0.0
His
0.708HisAla: 0.708 ± 0.372
0.0HisCys: 0.0 ± 0.0
1.18HisAsp: 1.18 ± 0.36
1.062HisGlu: 1.062 ± 0.417
0.826HisPhe: 0.826 ± 0.329
0.944HisGly: 0.944 ± 0.303
0.236HisHis: 0.236 ± 0.166
0.826HisIle: 0.826 ± 0.3
0.708HisLys: 0.708 ± 0.319
1.534HisLeu: 1.534 ± 0.55
1.062HisMet: 1.062 ± 0.357
0.59HisAsn: 0.59 ± 0.281
0.708HisPro: 0.708 ± 0.299
0.59HisGln: 0.59 ± 0.299
0.354HisArg: 0.354 ± 0.226
0.826HisSer: 0.826 ± 0.25
1.18HisThr: 1.18 ± 0.418
0.354HisVal: 0.354 ± 0.227
0.0HisTrp: 0.0 ± 0.0
0.826HisTyr: 0.826 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
4.131IleAla: 4.131 ± 1.115
0.118IleCys: 0.118 ± 0.123
5.547IleAsp: 5.547 ± 0.781
5.193IleGlu: 5.193 ± 1.019
3.069IlePhe: 3.069 ± 0.577
5.075IleGly: 5.075 ± 1.57
0.826IleHis: 0.826 ± 0.258
4.957IleIle: 4.957 ± 0.715
5.783IleLys: 5.783 ± 0.908
4.957IleLeu: 4.957 ± 0.707
1.77IleMet: 1.77 ± 0.39
4.485IleAsn: 4.485 ± 0.449
2.124IlePro: 2.124 ± 0.501
2.951IleGln: 2.951 ± 0.603
1.77IleArg: 1.77 ± 0.484
6.491IleSer: 6.491 ± 1.009
5.547IleThr: 5.547 ± 0.866
4.013IleVal: 4.013 ± 0.637
0.472IleTrp: 0.472 ± 0.219
2.951IleTyr: 2.951 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
4.957LysAla: 4.957 ± 0.773
0.236LysCys: 0.236 ± 0.177
4.131LysAsp: 4.131 ± 0.692
4.249LysGlu: 4.249 ± 0.878
2.833LysPhe: 2.833 ± 0.569
4.131LysGly: 4.131 ± 0.597
1.534LysHis: 1.534 ± 0.444
5.783LysIle: 5.783 ± 0.687
5.547LysLys: 5.547 ± 0.91
7.081LysLeu: 7.081 ± 1.204
2.951LysMet: 2.951 ± 0.697
4.131LysAsn: 4.131 ± 0.777
2.36LysPro: 2.36 ± 0.747
2.715LysGln: 2.715 ± 0.466
3.541LysArg: 3.541 ± 0.807
4.603LysSer: 4.603 ± 0.706
5.665LysThr: 5.665 ± 0.827
3.895LysVal: 3.895 ± 0.798
0.944LysTrp: 0.944 ± 0.346
3.541LysTyr: 3.541 ± 0.626
0.0LysXaa: 0.0 ± 0.0
Leu
5.075LeuAla: 5.075 ± 0.861
0.354LeuCys: 0.354 ± 0.181
6.963LeuAsp: 6.963 ± 0.614
6.255LeuGlu: 6.255 ± 1.074
2.951LeuPhe: 2.951 ± 0.476
6.373LeuGly: 6.373 ± 1.157
0.944LeuHis: 0.944 ± 0.306
4.131LeuIle: 4.131 ± 0.713
7.789LeuLys: 7.789 ± 1.029
5.429LeuLeu: 5.429 ± 0.844
2.596LeuMet: 2.596 ± 0.507
5.901LeuAsn: 5.901 ± 0.752
2.478LeuPro: 2.478 ± 0.607
2.715LeuGln: 2.715 ± 0.553
2.715LeuArg: 2.715 ± 0.599
6.137LeuSer: 6.137 ± 1.102
5.193LeuThr: 5.193 ± 1.001
5.311LeuVal: 5.311 ± 0.946
0.354LeuTrp: 0.354 ± 0.194
2.478LeuTyr: 2.478 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
2.715MetAla: 2.715 ± 0.53
0.236MetCys: 0.236 ± 0.166
1.298MetAsp: 1.298 ± 0.367
1.062MetGlu: 1.062 ± 0.33
1.18MetPhe: 1.18 ± 0.414
2.242MetGly: 2.242 ± 0.473
0.236MetHis: 0.236 ± 0.156
1.416MetIle: 1.416 ± 0.413
2.006MetLys: 2.006 ± 0.466
1.888MetLeu: 1.888 ± 0.389
0.59MetMet: 0.59 ± 0.258
2.242MetAsn: 2.242 ± 0.533
1.298MetPro: 1.298 ± 0.392
0.708MetGln: 0.708 ± 0.282
0.826MetArg: 0.826 ± 0.298
2.36MetSer: 2.36 ± 0.506
2.951MetThr: 2.951 ± 0.921
1.77MetVal: 1.77 ± 0.414
0.354MetTrp: 0.354 ± 0.209
1.77MetTyr: 1.77 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
4.131AsnAla: 4.131 ± 0.74
0.354AsnCys: 0.354 ± 0.143
2.833AsnAsp: 2.833 ± 0.579
4.485AsnGlu: 4.485 ± 0.964
3.069AsnPhe: 3.069 ± 0.681
5.429AsnGly: 5.429 ± 1.015
1.18AsnHis: 1.18 ± 0.393
5.429AsnIle: 5.429 ± 0.963
4.249AsnLys: 4.249 ± 0.855
4.839AsnLeu: 4.839 ± 1.026
1.77AsnMet: 1.77 ± 0.352
5.311AsnAsn: 5.311 ± 0.94
3.305AsnPro: 3.305 ± 0.77
1.888AsnGln: 1.888 ± 0.342
2.596AsnArg: 2.596 ± 0.704
4.013AsnSer: 4.013 ± 0.707
4.485AsnThr: 4.485 ± 0.72
4.839AsnVal: 4.839 ± 0.573
0.472AsnTrp: 0.472 ± 0.237
3.423AsnTyr: 3.423 ± 0.678
0.0AsnXaa: 0.0 ± 0.0
Pro
2.124ProAla: 2.124 ± 0.58
0.0ProCys: 0.0 ± 0.0
2.124ProAsp: 2.124 ± 0.473
2.006ProGlu: 2.006 ± 0.464
1.534ProPhe: 1.534 ± 0.383
0.708ProGly: 0.708 ± 0.245
0.236ProHis: 0.236 ± 0.192
1.652ProIle: 1.652 ± 0.405
2.596ProLys: 2.596 ± 0.662
1.77ProLeu: 1.77 ± 0.501
1.18ProMet: 1.18 ± 0.444
2.478ProAsn: 2.478 ± 0.625
0.354ProPro: 0.354 ± 0.194
1.062ProGln: 1.062 ± 0.416
0.708ProArg: 0.708 ± 0.365
2.006ProSer: 2.006 ± 0.424
2.833ProThr: 2.833 ± 0.528
1.652ProVal: 1.652 ± 0.462
0.0ProTrp: 0.0 ± 0.0
1.416ProTyr: 1.416 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
3.069GlnAla: 3.069 ± 0.555
0.0GlnCys: 0.0 ± 0.0
2.36GlnAsp: 2.36 ± 0.598
1.888GlnGlu: 1.888 ± 0.384
1.888GlnPhe: 1.888 ± 0.345
1.652GlnGly: 1.652 ± 0.45
0.472GlnHis: 0.472 ± 0.232
2.951GlnIle: 2.951 ± 0.56
2.478GlnLys: 2.478 ± 0.553
2.951GlnLeu: 2.951 ± 0.66
0.708GlnMet: 0.708 ± 0.266
2.242GlnAsn: 2.242 ± 0.518
0.826GlnPro: 0.826 ± 0.266
0.944GlnGln: 0.944 ± 0.377
1.652GlnArg: 1.652 ± 0.416
2.715GlnSer: 2.715 ± 0.567
1.416GlnThr: 1.416 ± 0.431
2.833GlnVal: 2.833 ± 0.48
0.236GlnTrp: 0.236 ± 0.159
1.062GlnTyr: 1.062 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
1.888ArgAla: 1.888 ± 0.362
0.354ArgCys: 0.354 ± 0.239
2.715ArgAsp: 2.715 ± 0.635
1.652ArgGlu: 1.652 ± 0.448
1.298ArgPhe: 1.298 ± 0.439
2.36ArgGly: 2.36 ± 0.678
0.59ArgHis: 0.59 ± 0.295
2.715ArgIle: 2.715 ± 0.543
1.77ArgLys: 1.77 ± 0.59
4.957ArgLeu: 4.957 ± 0.644
1.062ArgMet: 1.062 ± 0.355
2.833ArgAsn: 2.833 ± 0.518
0.944ArgPro: 0.944 ± 0.444
1.416ArgGln: 1.416 ± 0.374
1.77ArgArg: 1.77 ± 0.569
2.596ArgSer: 2.596 ± 0.584
2.478ArgThr: 2.478 ± 0.353
2.596ArgVal: 2.596 ± 0.594
0.59ArgTrp: 0.59 ± 0.257
0.944ArgTyr: 0.944 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
4.013SerAla: 4.013 ± 1.092
0.118SerCys: 0.118 ± 0.116
4.721SerAsp: 4.721 ± 0.908
2.36SerGlu: 2.36 ± 0.566
3.423SerPhe: 3.423 ± 0.542
5.901SerGly: 5.901 ± 1.577
0.944SerHis: 0.944 ± 0.354
5.547SerIle: 5.547 ± 1.098
4.249SerLys: 4.249 ± 0.769
4.957SerLeu: 4.957 ± 0.898
2.242SerMet: 2.242 ± 0.501
4.721SerAsn: 4.721 ± 0.922
1.888SerPro: 1.888 ± 0.446
3.069SerGln: 3.069 ± 0.508
2.596SerArg: 2.596 ± 0.668
6.019SerSer: 6.019 ± 2.014
6.727SerThr: 6.727 ± 0.795
5.665SerVal: 5.665 ± 1.098
0.59SerTrp: 0.59 ± 0.292
2.951SerTyr: 2.951 ± 0.669
0.0SerXaa: 0.0 ± 0.0
Thr
4.013ThrAla: 4.013 ± 0.606
0.118ThrCys: 0.118 ± 0.119
2.951ThrAsp: 2.951 ± 0.624
2.951ThrGlu: 2.951 ± 0.645
3.423ThrPhe: 3.423 ± 0.586
5.075ThrGly: 5.075 ± 0.687
1.062ThrHis: 1.062 ± 0.36
4.367ThrIle: 4.367 ± 0.836
6.609ThrLys: 6.609 ± 0.581
6.255ThrLeu: 6.255 ± 1.061
1.062ThrMet: 1.062 ± 0.349
5.075ThrAsn: 5.075 ± 0.698
3.069ThrPro: 3.069 ± 0.575
4.013ThrGln: 4.013 ± 0.723
2.951ThrArg: 2.951 ± 0.599
5.075ThrSer: 5.075 ± 0.663
4.013ThrThr: 4.013 ± 0.79
3.305ThrVal: 3.305 ± 0.67
0.472ThrTrp: 0.472 ± 0.19
1.888ThrTyr: 1.888 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
4.249ValAla: 4.249 ± 0.524
0.236ValCys: 0.236 ± 0.146
3.777ValAsp: 3.777 ± 0.591
3.659ValGlu: 3.659 ± 0.527
3.541ValPhe: 3.541 ± 0.705
4.957ValGly: 4.957 ± 0.905
0.354ValHis: 0.354 ± 0.182
6.019ValIle: 6.019 ± 1.152
4.013ValLys: 4.013 ± 0.592
4.721ValLeu: 4.721 ± 0.699
1.416ValMet: 1.416 ± 0.293
4.131ValAsn: 4.131 ± 0.718
1.888ValPro: 1.888 ± 0.371
1.77ValGln: 1.77 ± 0.506
3.777ValArg: 3.777 ± 0.71
4.485ValSer: 4.485 ± 0.83
4.013ValThr: 4.013 ± 0.681
3.541ValVal: 3.541 ± 0.771
1.416ValTrp: 1.416 ± 0.344
3.423ValTyr: 3.423 ± 0.688
0.0ValXaa: 0.0 ± 0.0
Trp
0.472TrpAla: 0.472 ± 0.265
0.0TrpCys: 0.0 ± 0.0
1.062TrpAsp: 1.062 ± 0.294
0.59TrpGlu: 0.59 ± 0.234
0.236TrpPhe: 0.236 ± 0.166
1.298TrpGly: 1.298 ± 0.376
0.472TrpHis: 0.472 ± 0.222
0.118TrpIle: 0.118 ± 0.107
0.59TrpLys: 0.59 ± 0.251
0.944TrpLeu: 0.944 ± 0.393
0.354TrpMet: 0.354 ± 0.219
0.472TrpAsn: 0.472 ± 0.252
0.0TrpPro: 0.0 ± 0.0
0.708TrpGln: 0.708 ± 0.197
0.472TrpArg: 0.472 ± 0.269
0.944TrpSer: 0.944 ± 0.326
0.826TrpThr: 0.826 ± 0.349
0.59TrpVal: 0.59 ± 0.239
0.236TrpTrp: 0.236 ± 0.151
0.472TrpTyr: 0.472 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.596TyrAla: 2.596 ± 0.538
0.354TyrCys: 0.354 ± 0.167
2.951TyrAsp: 2.951 ± 0.682
3.187TyrGlu: 3.187 ± 0.695
1.77TyrPhe: 1.77 ± 0.389
2.596TyrGly: 2.596 ± 0.712
0.826TyrHis: 0.826 ± 0.312
2.951TyrIle: 2.951 ± 0.543
3.541TyrLys: 3.541 ± 0.614
4.485TyrLeu: 4.485 ± 0.706
0.944TyrMet: 0.944 ± 0.369
3.187TyrAsn: 3.187 ± 0.637
1.298TyrPro: 1.298 ± 0.422
1.18TyrGln: 1.18 ± 0.391
1.416TyrArg: 1.416 ± 0.538
2.951TyrSer: 2.951 ± 0.833
2.242TyrThr: 2.242 ± 0.575
2.951TyrVal: 2.951 ± 0.622
0.236TyrTrp: 0.236 ± 0.177
2.006TyrTyr: 2.006 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (8474 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski