Amino acid dipepetide frequency for Lactococcus phage BK5-T

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.804AlaAla: 4.804 ± 1.842
0.31AlaCys: 0.31 ± 0.176
3.641AlaAsp: 3.641 ± 0.446
4.649AlaGlu: 4.649 ± 0.76
2.479AlaPhe: 2.479 ± 0.596
5.191AlaGly: 5.191 ± 1.017
0.775AlaHis: 0.775 ± 0.203
4.726AlaIle: 4.726 ± 0.791
5.114AlaLys: 5.114 ± 0.978
4.804AlaLeu: 4.804 ± 0.527
1.937AlaMet: 1.937 ± 0.285
3.332AlaAsn: 3.332 ± 0.633
2.092AlaPro: 2.092 ± 0.512
3.332AlaGln: 3.332 ± 0.978
1.317AlaArg: 1.317 ± 0.342
4.494AlaSer: 4.494 ± 0.717
4.029AlaThr: 4.029 ± 0.596
3.951AlaVal: 3.951 ± 0.735
0.93AlaTrp: 0.93 ± 0.317
2.867AlaTyr: 2.867 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.155CysAla: 0.155 ± 0.104
0.077CysCys: 0.077 ± 0.078
0.155CysAsp: 0.155 ± 0.11
0.232CysGlu: 0.232 ± 0.132
0.62CysPhe: 0.62 ± 0.276
0.31CysGly: 0.31 ± 0.197
0.155CysHis: 0.155 ± 0.157
0.232CysIle: 0.232 ± 0.178
0.465CysLys: 0.465 ± 0.238
0.542CysLeu: 0.542 ± 0.23
0.155CysMet: 0.155 ± 0.099
0.387CysAsn: 0.387 ± 0.2
0.31CysPro: 0.31 ± 0.196
0.232CysGln: 0.232 ± 0.123
0.155CysArg: 0.155 ± 0.103
0.62CysSer: 0.62 ± 0.244
0.465CysThr: 0.465 ± 0.154
0.31CysVal: 0.31 ± 0.134
0.077CysTrp: 0.077 ± 0.077
0.232CysTyr: 0.232 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
3.254AspAla: 3.254 ± 0.62
0.31AspCys: 0.31 ± 0.213
3.719AspAsp: 3.719 ± 0.615
3.796AspGlu: 3.796 ± 0.677
2.944AspPhe: 2.944 ± 0.541
5.578AspGly: 5.578 ± 1.677
0.775AspHis: 0.775 ± 0.179
3.254AspIle: 3.254 ± 0.571
5.423AspLys: 5.423 ± 0.564
4.959AspLeu: 4.959 ± 0.69
1.162AspMet: 1.162 ± 0.253
4.804AspAsn: 4.804 ± 0.609
1.627AspPro: 1.627 ± 0.363
1.705AspGln: 1.705 ± 0.401
1.859AspArg: 1.859 ± 0.397
4.261AspSer: 4.261 ± 0.48
2.634AspThr: 2.634 ± 0.319
3.719AspVal: 3.719 ± 0.654
0.775AspTrp: 0.775 ± 0.272
2.944AspTyr: 2.944 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
4.184GluAla: 4.184 ± 0.778
0.155GluCys: 0.155 ± 0.112
2.557GluAsp: 2.557 ± 0.51
4.184GluGlu: 4.184 ± 0.954
2.247GluPhe: 2.247 ± 0.429
3.254GluGly: 3.254 ± 0.621
0.852GluHis: 0.852 ± 0.253
5.501GluIle: 5.501 ± 0.832
6.663GluLys: 6.663 ± 1.323
5.811GluLeu: 5.811 ± 1.102
1.162GluMet: 1.162 ± 0.341
2.712GluAsn: 2.712 ± 0.467
2.169GluPro: 2.169 ± 0.628
2.402GluGln: 2.402 ± 0.392
1.937GluArg: 1.937 ± 0.412
3.177GluSer: 3.177 ± 0.565
3.254GluThr: 3.254 ± 0.691
3.564GluVal: 3.564 ± 0.512
0.62GluTrp: 0.62 ± 0.233
1.937GluTyr: 1.937 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
1.937PheAla: 1.937 ± 0.376
0.387PheCys: 0.387 ± 0.155
2.634PheAsp: 2.634 ± 0.506
2.092PheGlu: 2.092 ± 0.492
1.395PhePhe: 1.395 ± 0.277
2.479PheGly: 2.479 ± 0.477
0.62PheHis: 0.62 ± 0.201
3.564PheIle: 3.564 ± 0.766
3.409PheLys: 3.409 ± 0.472
3.254PheLeu: 3.254 ± 0.44
0.93PheMet: 0.93 ± 0.359
2.944PheAsn: 2.944 ± 0.502
1.55PhePro: 1.55 ± 0.304
1.705PheGln: 1.705 ± 0.368
1.395PheArg: 1.395 ± 0.325
3.099PheSer: 3.099 ± 0.522
2.402PheThr: 2.402 ± 0.514
2.402PheVal: 2.402 ± 0.367
0.232PheTrp: 0.232 ± 0.122
1.472PheTyr: 1.472 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
3.022GlyAla: 3.022 ± 0.612
0.155GlyCys: 0.155 ± 0.108
3.719GlyAsp: 3.719 ± 0.7
2.479GlyGlu: 2.479 ± 0.376
2.634GlyPhe: 2.634 ± 0.41
4.184GlyGly: 4.184 ± 0.93
1.24GlyHis: 1.24 ± 0.307
6.121GlyIle: 6.121 ± 1.001
6.353GlyLys: 6.353 ± 1.325
5.966GlyLeu: 5.966 ± 0.748
1.395GlyMet: 1.395 ± 0.356
5.501GlyAsn: 5.501 ± 1.161
1.007GlyPro: 1.007 ± 0.257
3.022GlyGln: 3.022 ± 0.544
1.472GlyArg: 1.472 ± 0.455
5.191GlySer: 5.191 ± 0.965
6.508GlyThr: 6.508 ± 1.989
3.564GlyVal: 3.564 ± 0.727
1.627GlyTrp: 1.627 ± 0.554
3.409GlyTyr: 3.409 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.852HisAla: 0.852 ± 0.344
0.077HisCys: 0.077 ± 0.075
1.085HisAsp: 1.085 ± 0.397
0.697HisGlu: 0.697 ± 0.309
0.62HisPhe: 0.62 ± 0.263
1.24HisGly: 1.24 ± 0.369
0.465HisHis: 0.465 ± 0.178
0.62HisIle: 0.62 ± 0.181
1.395HisLys: 1.395 ± 0.332
0.93HisLeu: 0.93 ± 0.268
0.387HisMet: 0.387 ± 0.125
0.852HisAsn: 0.852 ± 0.225
0.232HisPro: 0.232 ± 0.103
0.542HisGln: 0.542 ± 0.218
0.387HisArg: 0.387 ± 0.173
0.775HisSer: 0.775 ± 0.205
1.162HisThr: 1.162 ± 0.279
0.387HisVal: 0.387 ± 0.156
0.155HisTrp: 0.155 ± 0.174
0.852HisTyr: 0.852 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.268IleAla: 5.268 ± 0.644
0.697IleCys: 0.697 ± 0.203
4.804IleAsp: 4.804 ± 0.745
4.726IleGlu: 4.726 ± 0.796
2.014IlePhe: 2.014 ± 0.58
3.564IleGly: 3.564 ± 0.687
1.007IleHis: 1.007 ± 0.278
4.726IleIle: 4.726 ± 0.875
5.268IleLys: 5.268 ± 0.48
4.649IleLeu: 4.649 ± 0.884
1.627IleMet: 1.627 ± 0.355
5.423IleAsn: 5.423 ± 0.466
2.712IlePro: 2.712 ± 0.569
2.169IleGln: 2.169 ± 0.349
2.324IleArg: 2.324 ± 0.484
5.966IleSer: 5.966 ± 0.88
5.423IleThr: 5.423 ± 0.878
3.874IleVal: 3.874 ± 0.587
0.852IleTrp: 0.852 ± 0.298
2.789IleTyr: 2.789 ± 0.543
0.0IleXaa: 0.0 ± 0.0
Lys
6.353LysAla: 6.353 ± 1.491
0.62LysCys: 0.62 ± 0.231
6.586LysAsp: 6.586 ± 0.987
5.191LysGlu: 5.191 ± 0.785
3.332LysPhe: 3.332 ± 0.419
5.656LysGly: 5.656 ± 1.047
1.24LysHis: 1.24 ± 0.287
6.198LysIle: 6.198 ± 0.835
7.36LysLys: 7.36 ± 1.072
6.741LysLeu: 6.741 ± 0.975
2.247LysMet: 2.247 ± 0.477
6.895LysAsn: 6.895 ± 0.949
2.092LysPro: 2.092 ± 0.488
4.184LysGln: 4.184 ± 0.87
2.944LysArg: 2.944 ± 0.54
5.423LysSer: 5.423 ± 0.692
6.198LysThr: 6.198 ± 0.576
4.261LysVal: 4.261 ± 0.667
1.007LysTrp: 1.007 ± 0.257
2.169LysTyr: 2.169 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
5.346LeuAla: 5.346 ± 0.83
0.31LeuCys: 0.31 ± 0.144
3.564LeuAsp: 3.564 ± 0.758
4.494LeuGlu: 4.494 ± 0.56
3.099LeuPhe: 3.099 ± 0.723
4.029LeuGly: 4.029 ± 0.634
1.007LeuHis: 1.007 ± 0.228
4.881LeuIle: 4.881 ± 0.697
6.741LeuLys: 6.741 ± 0.8
6.353LeuLeu: 6.353 ± 0.856
2.867LeuMet: 2.867 ± 0.587
5.811LeuAsn: 5.811 ± 0.544
3.022LeuPro: 3.022 ± 0.588
4.261LeuGln: 4.261 ± 0.634
3.254LeuArg: 3.254 ± 0.665
6.741LeuSer: 6.741 ± 1.035
5.268LeuThr: 5.268 ± 0.716
3.486LeuVal: 3.486 ± 0.483
1.705LeuTrp: 1.705 ± 0.785
2.557LeuTyr: 2.557 ± 0.483
0.0LeuXaa: 0.0 ± 0.0
Met
1.782MetAla: 1.782 ± 0.377
0.31MetCys: 0.31 ± 0.145
0.93MetAsp: 0.93 ± 0.26
2.092MetGlu: 2.092 ± 0.451
0.93MetPhe: 0.93 ± 0.27
2.169MetGly: 2.169 ± 0.524
0.155MetHis: 0.155 ± 0.104
1.395MetIle: 1.395 ± 0.356
2.789MetLys: 2.789 ± 0.558
0.697MetLeu: 0.697 ± 0.26
1.085MetMet: 1.085 ± 0.275
1.472MetAsn: 1.472 ± 0.42
1.007MetPro: 1.007 ± 0.237
1.162MetGln: 1.162 ± 0.3
0.697MetArg: 0.697 ± 0.206
1.705MetSer: 1.705 ± 0.391
2.634MetThr: 2.634 ± 0.439
1.55MetVal: 1.55 ± 0.405
0.232MetTrp: 0.232 ± 0.114
0.542MetTyr: 0.542 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
3.641AsnAla: 3.641 ± 0.484
0.62AsnCys: 0.62 ± 0.267
4.029AsnAsp: 4.029 ± 0.374
2.479AsnGlu: 2.479 ± 0.514
2.789AsnPhe: 2.789 ± 0.571
6.586AsnGly: 6.586 ± 2.044
1.162AsnHis: 1.162 ± 0.368
4.571AsnIle: 4.571 ± 0.817
5.501AsnLys: 5.501 ± 0.732
5.501AsnLeu: 5.501 ± 0.655
1.55AsnMet: 1.55 ± 0.376
5.036AsnAsn: 5.036 ± 0.973
2.789AsnPro: 2.789 ± 0.561
2.014AsnGln: 2.014 ± 0.454
2.789AsnArg: 2.789 ± 0.558
5.114AsnSer: 5.114 ± 0.819
4.726AsnThr: 4.726 ± 0.814
3.022AsnVal: 3.022 ± 0.46
0.62AsnTrp: 0.62 ± 0.241
2.789AsnTyr: 2.789 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
1.705ProAla: 1.705 ± 0.509
0.0ProCys: 0.0 ± 0.0
1.627ProAsp: 1.627 ± 0.388
1.162ProGlu: 1.162 ± 0.362
1.472ProPhe: 1.472 ± 0.29
2.014ProGly: 2.014 ± 0.651
0.387ProHis: 0.387 ± 0.165
2.402ProIle: 2.402 ± 0.581
2.092ProLys: 2.092 ± 0.364
2.169ProLeu: 2.169 ± 0.394
0.775ProMet: 0.775 ± 0.234
3.099ProAsn: 3.099 ± 0.779
1.395ProPro: 1.395 ± 0.464
1.55ProGln: 1.55 ± 0.33
0.93ProArg: 0.93 ± 0.321
2.247ProSer: 2.247 ± 0.418
2.634ProThr: 2.634 ± 0.512
2.402ProVal: 2.402 ± 0.353
0.465ProTrp: 0.465 ± 0.202
1.007ProTyr: 1.007 ± 0.308
0.0ProXaa: 0.0 ± 0.0
Gln
3.332GlnAla: 3.332 ± 0.51
0.155GlnCys: 0.155 ± 0.092
3.409GlnAsp: 3.409 ± 0.578
2.867GlnGlu: 2.867 ± 0.346
2.169GlnPhe: 2.169 ± 0.425
2.557GlnGly: 2.557 ± 0.537
0.232GlnHis: 0.232 ± 0.138
2.324GlnIle: 2.324 ± 0.414
3.719GlnLys: 3.719 ± 1.14
4.261GlnLeu: 4.261 ± 0.579
1.395GlnMet: 1.395 ± 0.374
2.324GlnAsn: 2.324 ± 0.401
0.465GlnPro: 0.465 ± 0.173
2.092GlnGln: 2.092 ± 0.469
1.24GlnArg: 1.24 ± 0.411
2.789GlnSer: 2.789 ± 0.509
2.247GlnThr: 2.247 ± 0.524
2.402GlnVal: 2.402 ± 0.375
0.465GlnTrp: 0.465 ± 0.182
1.782GlnTyr: 1.782 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
1.705ArgAla: 1.705 ± 0.402
0.387ArgCys: 0.387 ± 0.214
1.859ArgAsp: 1.859 ± 0.272
2.169ArgGlu: 2.169 ± 0.632
1.317ArgPhe: 1.317 ± 0.367
1.782ArgGly: 1.782 ± 0.377
0.62ArgHis: 0.62 ± 0.264
2.169ArgIle: 2.169 ± 0.491
2.867ArgLys: 2.867 ± 0.611
2.634ArgLeu: 2.634 ± 0.588
1.007ArgMet: 1.007 ± 0.242
2.247ArgAsn: 2.247 ± 0.432
0.697ArgPro: 0.697 ± 0.234
1.162ArgGln: 1.162 ± 0.295
1.162ArgArg: 1.162 ± 0.287
1.627ArgSer: 1.627 ± 0.395
2.867ArgThr: 2.867 ± 0.476
1.705ArgVal: 1.705 ± 0.371
0.775ArgTrp: 0.775 ± 0.271
1.472ArgTyr: 1.472 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
4.339SerAla: 4.339 ± 0.701
0.387SerCys: 0.387 ± 0.218
4.959SerAsp: 4.959 ± 0.55
4.959SerGlu: 4.959 ± 0.572
3.796SerPhe: 3.796 ± 0.61
5.268SerGly: 5.268 ± 0.584
1.085SerHis: 1.085 ± 0.295
4.804SerIle: 4.804 ± 0.679
4.726SerLys: 4.726 ± 0.598
5.423SerLeu: 5.423 ± 0.827
1.782SerMet: 1.782 ± 0.332
4.339SerAsn: 4.339 ± 0.664
1.472SerPro: 1.472 ± 0.293
2.712SerGln: 2.712 ± 0.726
2.169SerArg: 2.169 ± 0.45
5.268SerSer: 5.268 ± 0.77
5.501SerThr: 5.501 ± 0.865
5.346SerVal: 5.346 ± 0.615
0.387SerTrp: 0.387 ± 0.179
2.402SerTyr: 2.402 ± 0.279
0.0SerXaa: 0.0 ± 0.0
Thr
5.191ThrAla: 5.191 ± 0.628
0.077ThrCys: 0.077 ± 0.084
4.184ThrAsp: 4.184 ± 0.874
3.332ThrGlu: 3.332 ± 0.462
2.324ThrPhe: 2.324 ± 0.565
6.043ThrGly: 6.043 ± 1.311
0.852ThrHis: 0.852 ± 0.231
5.578ThrIle: 5.578 ± 0.648
6.276ThrLys: 6.276 ± 0.669
4.726ThrLeu: 4.726 ± 0.74
1.085ThrMet: 1.085 ± 0.331
4.494ThrAsn: 4.494 ± 0.938
1.859ThrPro: 1.859 ± 0.58
3.099ThrGln: 3.099 ± 0.537
2.479ThrArg: 2.479 ± 0.298
5.656ThrSer: 5.656 ± 1.01
6.508ThrThr: 6.508 ± 1.747
5.656ThrVal: 5.656 ± 1.014
1.395ThrTrp: 1.395 ± 0.483
3.874ThrTyr: 3.874 ± 1.436
0.0ThrXaa: 0.0 ± 0.0
Val
4.416ValAla: 4.416 ± 1.077
0.387ValCys: 0.387 ± 0.144
3.796ValAsp: 3.796 ± 0.501
3.486ValGlu: 3.486 ± 0.506
1.782ValPhe: 1.782 ± 0.408
3.022ValGly: 3.022 ± 0.523
0.387ValHis: 0.387 ± 0.174
3.409ValIle: 3.409 ± 0.618
6.276ValLys: 6.276 ± 0.73
5.036ValLeu: 5.036 ± 0.903
1.317ValMet: 1.317 ± 0.435
3.099ValAsn: 3.099 ± 0.374
3.564ValPro: 3.564 ± 0.744
2.092ValGln: 2.092 ± 0.319
1.55ValArg: 1.55 ± 0.513
2.789ValSer: 2.789 ± 0.302
4.571ValThr: 4.571 ± 0.463
3.874ValVal: 3.874 ± 0.397
0.93ValTrp: 0.93 ± 0.409
1.627ValTyr: 1.627 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
0.852TrpAla: 0.852 ± 0.205
0.0TrpCys: 0.0 ± 0.0
0.542TrpAsp: 0.542 ± 0.256
0.775TrpGlu: 0.775 ± 0.228
0.31TrpPhe: 0.31 ± 0.149
0.775TrpGly: 0.775 ± 0.152
0.232TrpHis: 0.232 ± 0.105
0.62TrpIle: 0.62 ± 0.222
1.162TrpLys: 1.162 ± 0.318
1.24TrpLeu: 1.24 ± 0.268
0.387TrpMet: 0.387 ± 0.159
0.697TrpAsn: 0.697 ± 0.227
0.155TrpPro: 0.155 ± 0.114
0.62TrpGln: 0.62 ± 0.235
0.465TrpArg: 0.465 ± 0.142
0.93TrpSer: 0.93 ± 0.397
2.634TrpThr: 2.634 ± 1.568
0.775TrpVal: 0.775 ± 0.302
0.31TrpTrp: 0.31 ± 0.143
0.31TrpTyr: 0.31 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.634TyrAla: 2.634 ± 0.822
0.465TyrCys: 0.465 ± 0.192
1.705TyrAsp: 1.705 ± 0.287
2.479TyrGlu: 2.479 ± 0.355
1.627TyrPhe: 1.627 ± 0.309
2.867TyrGly: 2.867 ± 0.408
0.465TyrHis: 0.465 ± 0.23
2.789TyrIle: 2.789 ± 0.416
3.099TyrLys: 3.099 ± 0.591
3.099TyrLeu: 3.099 ± 0.536
1.007TyrMet: 1.007 ± 0.296
1.859TyrAsn: 1.859 ± 0.577
1.317TyrPro: 1.317 ± 0.326
2.169TyrGln: 2.169 ± 0.607
1.705TyrArg: 1.705 ± 0.364
3.254TyrSer: 3.254 ± 0.579
3.022TyrThr: 3.022 ± 0.976
1.162TyrVal: 1.162 ± 0.264
0.232TyrTrp: 0.232 ± 0.13
1.317TyrTyr: 1.317 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12908 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski