Amino acid dipepetide frequency for Lactobacillus phage Lenus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.83AlaAla: 2.83 ± 0.894
0.429AlaCys: 0.429 ± 0.123
3.687AlaAsp: 3.687 ± 0.382
2.572AlaGlu: 2.572 ± 0.308
1.929AlaPhe: 1.929 ± 0.281
4.587AlaGly: 4.587 ± 0.957
0.686AlaHis: 0.686 ± 0.142
4.159AlaIle: 4.159 ± 0.521
5.874AlaLys: 5.874 ± 0.952
5.659AlaLeu: 5.659 ± 0.707
1.715AlaMet: 1.715 ± 0.292
3.344AlaAsn: 3.344 ± 0.437
1.458AlaPro: 1.458 ± 0.251
2.272AlaGln: 2.272 ± 0.34
1.929AlaArg: 1.929 ± 0.311
3.73AlaSer: 3.73 ± 0.83
3.73AlaThr: 3.73 ± 0.475
4.63AlaVal: 4.63 ± 0.595
1.029AlaTrp: 1.029 ± 0.274
2.529AlaTyr: 2.529 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.257CysAla: 0.257 ± 0.095
0.086CysCys: 0.086 ± 0.066
0.214CysAsp: 0.214 ± 0.099
0.214CysGlu: 0.214 ± 0.092
0.429CysPhe: 0.429 ± 0.155
0.815CysGly: 0.815 ± 0.181
0.086CysHis: 0.086 ± 0.059
0.557CysIle: 0.557 ± 0.15
0.3CysLys: 0.3 ± 0.112
0.815CysLeu: 0.815 ± 0.21
0.171CysMet: 0.171 ± 0.093
0.557CysAsn: 0.557 ± 0.203
0.472CysPro: 0.472 ± 0.208
0.386CysGln: 0.386 ± 0.124
0.3CysArg: 0.3 ± 0.111
0.557CysSer: 0.557 ± 0.171
0.257CysThr: 0.257 ± 0.105
0.729CysVal: 0.729 ± 0.179
0.257CysTrp: 0.257 ± 0.124
0.6CysTyr: 0.6 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
3.901AspAla: 3.901 ± 0.419
0.686AspCys: 0.686 ± 0.21
6.86AspAsp: 6.86 ± 0.91
5.402AspGlu: 5.402 ± 0.698
3.644AspPhe: 3.644 ± 0.497
6.174AspGly: 6.174 ± 0.706
1.458AspHis: 1.458 ± 0.259
4.502AspIle: 4.502 ± 0.543
5.145AspLys: 5.145 ± 0.48
5.831AspLeu: 5.831 ± 0.601
2.315AspMet: 2.315 ± 0.312
4.416AspAsn: 4.416 ± 0.39
2.015AspPro: 2.015 ± 0.391
1.801AspGln: 1.801 ± 0.308
2.401AspArg: 2.401 ± 0.385
4.716AspSer: 4.716 ± 0.553
3.644AspThr: 3.644 ± 0.486
4.416AspVal: 4.416 ± 0.436
0.986AspTrp: 0.986 ± 0.189
3.558AspTyr: 3.558 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
2.915GluAla: 2.915 ± 0.396
0.815GluCys: 0.815 ± 0.166
4.03GluAsp: 4.03 ± 0.511
3.944GluGlu: 3.944 ± 0.456
2.487GluPhe: 2.487 ± 0.404
2.315GluGly: 2.315 ± 0.285
1.029GluHis: 1.029 ± 0.242
5.102GluIle: 5.102 ± 0.44
5.445GluLys: 5.445 ± 0.589
5.959GluLeu: 5.959 ± 0.538
2.615GluMet: 2.615 ± 0.362
3.944GluAsn: 3.944 ± 0.535
1.543GluPro: 1.543 ± 0.239
2.101GluGln: 2.101 ± 0.273
2.315GluArg: 2.315 ± 0.373
3.258GluSer: 3.258 ± 0.369
3.044GluThr: 3.044 ± 0.424
3.644GluVal: 3.644 ± 0.466
1.029GluTrp: 1.029 ± 0.206
2.701GluTyr: 2.701 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
2.144PheAla: 2.144 ± 0.338
0.514PheCys: 0.514 ± 0.158
3.73PheAsp: 3.73 ± 0.507
2.101PheGlu: 2.101 ± 0.483
1.715PhePhe: 1.715 ± 0.339
3.301PheGly: 3.301 ± 0.334
0.472PheHis: 0.472 ± 0.121
2.229PheIle: 2.229 ± 0.355
3.644PheLys: 3.644 ± 0.459
2.615PheLeu: 2.615 ± 0.397
0.9PheMet: 0.9 ± 0.188
2.401PheAsn: 2.401 ± 0.336
1.158PhePro: 1.158 ± 0.228
1.2PheGln: 1.2 ± 0.251
1.329PheArg: 1.329 ± 0.25
2.83PheSer: 2.83 ± 0.324
2.529PheThr: 2.529 ± 0.384
2.401PheVal: 2.401 ± 0.371
0.429PheTrp: 0.429 ± 0.139
1.758PheTyr: 1.758 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
3.859GlyAla: 3.859 ± 0.846
0.257GlyCys: 0.257 ± 0.128
4.802GlyAsp: 4.802 ± 0.433
3.944GlyGlu: 3.944 ± 0.47
2.83GlyPhe: 2.83 ± 0.346
4.93GlyGly: 4.93 ± 0.991
1.329GlyHis: 1.329 ± 0.262
4.802GlyIle: 4.802 ± 0.591
6.645GlyLys: 6.645 ± 1.214
6.302GlyLeu: 6.302 ± 0.567
2.315GlyMet: 2.315 ± 0.318
4.63GlyAsn: 4.63 ± 0.532
0.729GlyPro: 0.729 ± 0.227
2.101GlyGln: 2.101 ± 0.338
3.001GlyArg: 3.001 ± 0.361
5.145GlySer: 5.145 ± 0.721
4.502GlyThr: 4.502 ± 0.642
3.944GlyVal: 3.944 ± 0.493
1.243GlyTrp: 1.243 ± 0.226
3.473GlyTyr: 3.473 ± 0.318
0.0GlyXaa: 0.0 ± 0.0
His
0.815HisAla: 0.815 ± 0.187
0.086HisCys: 0.086 ± 0.063
1.372HisAsp: 1.372 ± 0.26
1.2HisGlu: 1.2 ± 0.303
0.772HisPhe: 0.772 ± 0.204
1.243HisGly: 1.243 ± 0.229
0.386HisHis: 0.386 ± 0.147
1.415HisIle: 1.415 ± 0.311
1.072HisLys: 1.072 ± 0.201
0.943HisLeu: 0.943 ± 0.184
0.386HisMet: 0.386 ± 0.126
1.029HisAsn: 1.029 ± 0.233
0.429HisPro: 0.429 ± 0.149
0.514HisGln: 0.514 ± 0.151
0.643HisArg: 0.643 ± 0.186
1.286HisSer: 1.286 ± 0.298
0.857HisThr: 0.857 ± 0.191
1.2HisVal: 1.2 ± 0.193
0.171HisTrp: 0.171 ± 0.099
0.815HisTyr: 0.815 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.63IleAla: 4.63 ± 0.41
0.557IleCys: 0.557 ± 0.158
5.959IleAsp: 5.959 ± 0.582
4.416IleGlu: 4.416 ± 0.493
2.015IlePhe: 2.015 ± 0.305
5.445IleGly: 5.445 ± 0.923
0.986IleHis: 0.986 ± 0.206
4.759IleIle: 4.759 ± 0.503
6.388IleLys: 6.388 ± 0.634
4.116IleLeu: 4.116 ± 0.393
2.101IleMet: 2.101 ± 0.355
5.316IleAsn: 5.316 ± 0.492
2.615IlePro: 2.615 ± 0.369
1.672IleGln: 1.672 ± 0.23
2.144IleArg: 2.144 ± 0.314
5.102IleSer: 5.102 ± 0.395
4.287IleThr: 4.287 ± 0.454
4.673IleVal: 4.673 ± 0.464
0.643IleTrp: 0.643 ± 0.173
2.787IleTyr: 2.787 ± 0.358
0.0IleXaa: 0.0 ± 0.0
Lys
4.244LysAla: 4.244 ± 0.751
0.557LysCys: 0.557 ± 0.182
5.445LysAsp: 5.445 ± 0.523
5.016LysGlu: 5.016 ± 0.585
3.258LysPhe: 3.258 ± 0.352
3.859LysGly: 3.859 ± 0.64
1.286LysHis: 1.286 ± 0.248
6.774LysIle: 6.774 ± 0.596
6.431LysLys: 6.431 ± 0.526
6.645LysLeu: 6.645 ± 0.644
2.872LysMet: 2.872 ± 0.429
6.431LysAsn: 6.431 ± 0.587
1.972LysPro: 1.972 ± 0.276
3.215LysGln: 3.215 ± 0.407
3.173LysArg: 3.173 ± 0.345
5.016LysSer: 5.016 ± 0.635
5.188LysThr: 5.188 ± 0.543
4.63LysVal: 4.63 ± 0.435
1.2LysTrp: 1.2 ± 0.396
4.373LysTyr: 4.373 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
4.887LeuAla: 4.887 ± 0.604
0.557LeuCys: 0.557 ± 0.168
5.874LeuAsp: 5.874 ± 0.678
5.273LeuGlu: 5.273 ± 0.401
2.872LeuPhe: 2.872 ± 0.347
5.145LeuGly: 5.145 ± 0.643
1.286LeuHis: 1.286 ± 0.237
5.23LeuIle: 5.23 ± 0.558
6.302LeuLys: 6.302 ± 0.51
5.916LeuLeu: 5.916 ± 0.47
2.272LeuMet: 2.272 ± 0.431
4.33LeuAsn: 4.33 ± 0.487
2.658LeuPro: 2.658 ± 0.252
2.272LeuGln: 2.272 ± 0.249
3.215LeuArg: 3.215 ± 0.38
6.345LeuSer: 6.345 ± 0.445
4.544LeuThr: 4.544 ± 0.437
4.802LeuVal: 4.802 ± 0.557
0.9LeuTrp: 0.9 ± 0.18
2.701LeuTyr: 2.701 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
1.715MetAla: 1.715 ± 0.307
0.171MetCys: 0.171 ± 0.089
1.115MetAsp: 1.115 ± 0.209
1.801MetGlu: 1.801 ± 0.311
1.072MetPhe: 1.072 ± 0.206
1.501MetGly: 1.501 ± 0.278
0.429MetHis: 0.429 ± 0.123
2.444MetIle: 2.444 ± 0.34
2.401MetLys: 2.401 ± 0.304
2.615MetLeu: 2.615 ± 0.343
0.6MetMet: 0.6 ± 0.19
1.672MetAsn: 1.672 ± 0.354
0.472MetPro: 0.472 ± 0.164
1.286MetGln: 1.286 ± 0.201
1.115MetArg: 1.115 ± 0.223
2.401MetSer: 2.401 ± 0.385
1.586MetThr: 1.586 ± 0.267
2.315MetVal: 2.315 ± 0.38
0.3MetTrp: 0.3 ± 0.136
0.686MetTyr: 0.686 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
3.987AsnAla: 3.987 ± 0.417
0.343AsnCys: 0.343 ± 0.124
4.073AsnAsp: 4.073 ± 0.399
4.03AsnGlu: 4.03 ± 0.496
2.186AsnPhe: 2.186 ± 0.359
5.745AsnGly: 5.745 ± 0.639
1.115AsnHis: 1.115 ± 0.212
4.759AsnIle: 4.759 ± 0.519
5.573AsnLys: 5.573 ± 0.487
3.601AsnLeu: 3.601 ± 0.498
1.844AsnMet: 1.844 ± 0.234
4.373AsnAsn: 4.373 ± 0.464
1.715AsnPro: 1.715 ± 0.269
2.144AsnGln: 2.144 ± 0.334
2.315AsnArg: 2.315 ± 0.37
4.073AsnSer: 4.073 ± 0.415
3.687AsnThr: 3.687 ± 0.39
4.202AsnVal: 4.202 ± 0.479
0.9AsnTrp: 0.9 ± 0.206
3.215AsnTyr: 3.215 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
1.629ProAla: 1.629 ± 0.324
0.086ProCys: 0.086 ± 0.056
2.272ProAsp: 2.272 ± 0.318
2.186ProGlu: 2.186 ± 0.364
0.643ProPhe: 0.643 ± 0.17
1.286ProGly: 1.286 ± 0.209
0.429ProHis: 0.429 ± 0.148
2.101ProIle: 2.101 ± 0.38
2.015ProLys: 2.015 ± 0.29
1.501ProLeu: 1.501 ± 0.265
0.257ProMet: 0.257 ± 0.105
1.458ProAsn: 1.458 ± 0.196
0.386ProPro: 0.386 ± 0.14
1.072ProGln: 1.072 ± 0.223
0.772ProArg: 0.772 ± 0.252
2.144ProSer: 2.144 ± 0.28
1.629ProThr: 1.629 ± 0.238
2.401ProVal: 2.401 ± 0.336
0.343ProTrp: 0.343 ± 0.131
1.886ProTyr: 1.886 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
2.487GlnAla: 2.487 ± 0.501
0.386GlnCys: 0.386 ± 0.123
2.058GlnAsp: 2.058 ± 0.289
2.101GlnGlu: 2.101 ± 0.308
1.458GlnPhe: 1.458 ± 0.281
1.715GlnGly: 1.715 ± 0.31
0.729GlnHis: 0.729 ± 0.189
2.701GlnIle: 2.701 ± 0.321
2.101GlnLys: 2.101 ± 0.395
2.144GlnLeu: 2.144 ± 0.256
0.686GlnMet: 0.686 ± 0.162
1.801GlnAsn: 1.801 ± 0.296
0.986GlnPro: 0.986 ± 0.233
1.286GlnGln: 1.286 ± 0.272
1.2GlnArg: 1.2 ± 0.288
2.83GlnSer: 2.83 ± 0.293
2.744GlnThr: 2.744 ± 0.55
1.844GlnVal: 1.844 ± 0.229
0.171GlnTrp: 0.171 ± 0.078
1.929GlnTyr: 1.929 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
2.058ArgAla: 2.058 ± 0.323
0.214ArgCys: 0.214 ± 0.084
2.83ArgAsp: 2.83 ± 0.46
2.358ArgGlu: 2.358 ± 0.403
1.501ArgPhe: 1.501 ± 0.235
2.058ArgGly: 2.058 ± 0.259
0.729ArgHis: 0.729 ± 0.159
2.058ArgIle: 2.058 ± 0.294
2.958ArgLys: 2.958 ± 0.366
2.787ArgLeu: 2.787 ± 0.395
0.986ArgMet: 0.986 ± 0.213
1.672ArgAsn: 1.672 ± 0.271
1.072ArgPro: 1.072 ± 0.212
0.857ArgGln: 0.857 ± 0.177
1.072ArgArg: 1.072 ± 0.216
2.658ArgSer: 2.658 ± 0.306
2.101ArgThr: 2.101 ± 0.318
2.015ArgVal: 2.015 ± 0.296
0.472ArgTrp: 0.472 ± 0.144
1.972ArgTyr: 1.972 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
4.93SerAla: 4.93 ± 0.77
0.429SerCys: 0.429 ± 0.137
5.445SerAsp: 5.445 ± 0.536
4.416SerGlu: 4.416 ± 0.456
3.001SerPhe: 3.001 ± 0.333
6.002SerGly: 6.002 ± 0.772
1.243SerHis: 1.243 ± 0.221
4.502SerIle: 4.502 ± 0.42
5.788SerLys: 5.788 ± 0.659
5.273SerLeu: 5.273 ± 0.466
1.458SerMet: 1.458 ± 0.247
4.973SerAsn: 4.973 ± 0.478
1.715SerPro: 1.715 ± 0.288
2.701SerGln: 2.701 ± 0.587
2.101SerArg: 2.101 ± 0.261
6.174SerSer: 6.174 ± 0.684
3.773SerThr: 3.773 ± 0.534
4.416SerVal: 4.416 ± 0.359
1.158SerTrp: 1.158 ± 0.216
3.601SerTyr: 3.601 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
4.159ThrAla: 4.159 ± 0.615
0.3ThrCys: 0.3 ± 0.106
4.544ThrAsp: 4.544 ± 0.467
3.301ThrGlu: 3.301 ± 0.333
2.144ThrPhe: 2.144 ± 0.296
4.63ThrGly: 4.63 ± 0.538
0.986ThrHis: 0.986 ± 0.209
4.673ThrIle: 4.673 ± 0.547
3.901ThrLys: 3.901 ± 0.425
4.716ThrLeu: 4.716 ± 0.406
1.158ThrMet: 1.158 ± 0.198
3.301ThrAsn: 3.301 ± 0.403
1.929ThrPro: 1.929 ± 0.326
1.886ThrGln: 1.886 ± 0.289
1.415ThrArg: 1.415 ± 0.265
4.459ThrSer: 4.459 ± 0.571
3.859ThrThr: 3.859 ± 0.465
4.459ThrVal: 4.459 ± 0.685
0.386ThrTrp: 0.386 ± 0.14
2.83ThrTyr: 2.83 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
4.03ValAla: 4.03 ± 0.577
0.514ValCys: 0.514 ± 0.15
4.33ValAsp: 4.33 ± 0.572
3.473ValGlu: 3.473 ± 0.352
2.701ValPhe: 2.701 ± 0.375
4.845ValGly: 4.845 ± 0.454
0.857ValHis: 0.857 ± 0.189
4.33ValIle: 4.33 ± 0.494
5.359ValLys: 5.359 ± 0.497
5.402ValLeu: 5.402 ± 0.595
1.415ValMet: 1.415 ± 0.286
4.33ValAsn: 4.33 ± 0.463
1.629ValPro: 1.629 ± 0.255
1.801ValGln: 1.801 ± 0.385
1.886ValArg: 1.886 ± 0.273
6.088ValSer: 6.088 ± 0.52
3.901ValThr: 3.901 ± 0.395
4.716ValVal: 4.716 ± 0.478
0.772ValTrp: 0.772 ± 0.207
2.444ValTyr: 2.444 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.815TrpAla: 0.815 ± 0.176
0.257TrpCys: 0.257 ± 0.107
1.072TrpAsp: 1.072 ± 0.217
0.686TrpGlu: 0.686 ± 0.157
1.115TrpPhe: 1.115 ± 0.259
0.857TrpGly: 0.857 ± 0.172
0.171TrpHis: 0.171 ± 0.092
0.815TrpIle: 0.815 ± 0.186
0.729TrpLys: 0.729 ± 0.199
1.029TrpLeu: 1.029 ± 0.235
0.429TrpMet: 0.429 ± 0.142
0.772TrpAsn: 0.772 ± 0.173
0.257TrpPro: 0.257 ± 0.109
0.429TrpGln: 0.429 ± 0.12
0.557TrpArg: 0.557 ± 0.171
1.072TrpSer: 1.072 ± 0.22
0.729TrpThr: 0.729 ± 0.155
0.729TrpVal: 0.729 ± 0.167
0.3TrpTrp: 0.3 ± 0.121
0.514TrpTyr: 0.514 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.487TyrAla: 2.487 ± 0.31
0.815TyrCys: 0.815 ± 0.218
4.159TyrAsp: 4.159 ± 0.563
1.801TyrGlu: 1.801 ± 0.342
1.629TyrPhe: 1.629 ± 0.31
4.373TyrGly: 4.373 ± 0.476
0.943TyrHis: 0.943 ± 0.226
2.701TyrIle: 2.701 ± 0.343
3.516TyrLys: 3.516 ± 0.441
3.516TyrLeu: 3.516 ± 0.471
1.158TyrMet: 1.158 ± 0.225
3.13TyrAsn: 3.13 ± 0.352
1.286TyrPro: 1.286 ± 0.243
2.358TyrGln: 2.358 ± 0.4
1.543TyrArg: 1.543 ± 0.27
3.215TyrSer: 3.215 ± 0.456
2.444TyrThr: 2.444 ± 0.403
2.658TyrVal: 2.658 ± 0.407
0.643TyrTrp: 0.643 ± 0.135
2.144TyrTyr: 2.144 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 131 proteins (23326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski