Amino acid dipepetide frequency for Lactococcus phage bIL285

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.619AlaAla: 2.619 ± 0.523
0.271AlaCys: 0.271 ± 0.163
4.245AlaAsp: 4.245 ± 0.627
3.974AlaGlu: 3.974 ± 0.579
2.89AlaPhe: 2.89 ± 0.518
3.252AlaGly: 3.252 ± 0.615
0.542AlaHis: 0.542 ± 0.244
4.787AlaIle: 4.787 ± 0.743
5.51AlaLys: 5.51 ± 0.644
5.6AlaLeu: 5.6 ± 0.608
1.626AlaMet: 1.626 ± 0.367
2.8AlaAsn: 2.8 ± 0.437
1.807AlaPro: 1.807 ± 0.345
3.523AlaGln: 3.523 ± 0.486
1.536AlaArg: 1.536 ± 0.382
4.516AlaSer: 4.516 ± 0.66
3.884AlaThr: 3.884 ± 0.735
3.703AlaVal: 3.703 ± 0.911
1.445AlaTrp: 1.445 ± 0.322
1.626AlaTyr: 1.626 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.095
0.0CysCys: 0.0 ± 0.0
0.452CysAsp: 0.452 ± 0.255
0.542CysGlu: 0.542 ± 0.248
0.361CysPhe: 0.361 ± 0.224
0.632CysGly: 0.632 ± 0.261
0.542CysHis: 0.542 ± 0.336
0.361CysIle: 0.361 ± 0.162
0.542CysLys: 0.542 ± 0.21
0.723CysLeu: 0.723 ± 0.307
0.09CysMet: 0.09 ± 0.093
0.09CysAsn: 0.09 ± 0.091
0.09CysPro: 0.09 ± 0.096
0.0CysGln: 0.0 ± 0.0
0.361CysArg: 0.361 ± 0.169
0.813CysSer: 0.813 ± 0.286
0.271CysThr: 0.271 ± 0.155
0.271CysVal: 0.271 ± 0.203
0.181CysTrp: 0.181 ± 0.132
0.271CysTyr: 0.271 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
2.439AspAla: 2.439 ± 0.441
0.723AspCys: 0.723 ± 0.245
4.155AspAsp: 4.155 ± 0.865
5.42AspGlu: 5.42 ± 0.696
2.89AspPhe: 2.89 ± 0.635
4.516AspGly: 4.516 ± 0.854
0.632AspHis: 0.632 ± 0.239
5.239AspIle: 5.239 ± 0.731
5.962AspLys: 5.962 ± 0.879
6.232AspLeu: 6.232 ± 0.786
1.355AspMet: 1.355 ± 0.313
3.794AspAsn: 3.794 ± 0.496
1.807AspPro: 1.807 ± 0.41
1.536AspGln: 1.536 ± 0.369
1.536AspArg: 1.536 ± 0.354
4.787AspSer: 4.787 ± 0.594
3.342AspThr: 3.342 ± 0.536
3.794AspVal: 3.794 ± 0.73
1.265AspTrp: 1.265 ± 0.268
2.8AspTyr: 2.8 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
4.878GluAla: 4.878 ± 0.801
0.632GluCys: 0.632 ± 0.262
3.342GluAsp: 3.342 ± 0.512
6.594GluGlu: 6.594 ± 1.143
3.794GluPhe: 3.794 ± 0.539
3.161GluGly: 3.161 ± 0.445
0.723GluHis: 0.723 ± 0.22
5.871GluIle: 5.871 ± 0.957
7.678GluLys: 7.678 ± 1.034
8.942GluLeu: 8.942 ± 0.974
2.258GluMet: 2.258 ± 0.407
3.342GluAsn: 3.342 ± 0.515
1.536GluPro: 1.536 ± 0.284
3.613GluGln: 3.613 ± 0.564
2.89GluArg: 2.89 ± 0.558
3.523GluSer: 3.523 ± 0.479
3.613GluThr: 3.613 ± 0.694
3.432GluVal: 3.432 ± 0.573
0.452GluTrp: 0.452 ± 0.18
2.71GluTyr: 2.71 ± 0.56
0.0GluXaa: 0.0 ± 0.0
Phe
2.71PheAla: 2.71 ± 0.47
0.181PheCys: 0.181 ± 0.137
3.794PheAsp: 3.794 ± 0.613
2.981PheGlu: 2.981 ± 0.587
1.626PhePhe: 1.626 ± 0.355
3.613PheGly: 3.613 ± 0.67
0.903PheHis: 0.903 ± 0.257
3.071PheIle: 3.071 ± 0.515
4.426PheLys: 4.426 ± 0.518
2.529PheLeu: 2.529 ± 0.558
1.626PheMet: 1.626 ± 0.362
2.981PheAsn: 2.981 ± 0.523
1.084PhePro: 1.084 ± 0.222
0.903PheGln: 0.903 ± 0.259
1.355PheArg: 1.355 ± 0.309
3.613PheSer: 3.613 ± 0.726
2.89PheThr: 2.89 ± 0.592
2.077PheVal: 2.077 ± 0.409
0.452PheTrp: 0.452 ± 0.179
1.807PheTyr: 1.807 ± 0.434
0.0PheXaa: 0.0 ± 0.0
Gly
3.613GlyAla: 3.613 ± 1.1
0.361GlyCys: 0.361 ± 0.216
3.252GlyAsp: 3.252 ± 0.652
4.426GlyGlu: 4.426 ± 0.664
3.703GlyPhe: 3.703 ± 0.685
5.058GlyGly: 5.058 ± 1.057
0.723GlyHis: 0.723 ± 0.299
3.252GlyIle: 3.252 ± 0.565
4.426GlyLys: 4.426 ± 0.535
5.149GlyLeu: 5.149 ± 0.876
1.807GlyMet: 1.807 ± 0.363
4.336GlyAsn: 4.336 ± 0.866
0.542GlyPro: 0.542 ± 0.266
3.071GlyGln: 3.071 ± 0.664
1.536GlyArg: 1.536 ± 0.292
4.516GlySer: 4.516 ± 0.922
4.426GlyThr: 4.426 ± 0.745
3.342GlyVal: 3.342 ± 0.55
1.174GlyTrp: 1.174 ± 0.33
3.252GlyTyr: 3.252 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
1.807HisAla: 1.807 ± 0.532
0.181HisCys: 0.181 ± 0.163
0.994HisAsp: 0.994 ± 0.261
1.355HisGlu: 1.355 ± 0.325
0.903HisPhe: 0.903 ± 0.275
0.542HisGly: 0.542 ± 0.255
0.361HisHis: 0.361 ± 0.16
0.723HisIle: 0.723 ± 0.241
0.271HisLys: 0.271 ± 0.179
0.723HisLeu: 0.723 ± 0.306
0.181HisMet: 0.181 ± 0.12
0.813HisAsn: 0.813 ± 0.28
0.903HisPro: 0.903 ± 0.274
0.813HisGln: 0.813 ± 0.257
0.361HisArg: 0.361 ± 0.182
0.994HisSer: 0.994 ± 0.416
0.994HisThr: 0.994 ± 0.287
0.632HisVal: 0.632 ± 0.266
0.181HisTrp: 0.181 ± 0.133
0.542HisTyr: 0.542 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.968IleAla: 4.968 ± 0.723
0.181IleCys: 0.181 ± 0.141
4.697IleAsp: 4.697 ± 0.705
6.684IleGlu: 6.684 ± 0.825
1.536IlePhe: 1.536 ± 0.424
4.426IleGly: 4.426 ± 0.685
1.084IleHis: 1.084 ± 0.363
4.607IleIle: 4.607 ± 0.681
7.045IleLys: 7.045 ± 0.815
4.516IleLeu: 4.516 ± 0.675
1.626IleMet: 1.626 ± 0.342
3.342IleAsn: 3.342 ± 0.503
2.529IlePro: 2.529 ± 0.405
2.71IleGln: 2.71 ± 0.429
2.439IleArg: 2.439 ± 0.437
6.232IleSer: 6.232 ± 0.822
5.149IleThr: 5.149 ± 0.566
3.613IleVal: 3.613 ± 0.644
0.632IleTrp: 0.632 ± 0.3
2.258IleTyr: 2.258 ± 0.49
0.0IleXaa: 0.0 ± 0.0
Lys
5.691LysAla: 5.691 ± 0.656
0.632LysCys: 0.632 ± 0.254
5.781LysAsp: 5.781 ± 0.674
5.962LysGlu: 5.962 ± 0.952
3.884LysPhe: 3.884 ± 0.498
5.058LysGly: 5.058 ± 0.688
1.445LysHis: 1.445 ± 0.434
6.052LysIle: 6.052 ± 0.689
8.039LysLys: 8.039 ± 1.002
7.949LysLeu: 7.949 ± 0.85
2.348LysMet: 2.348 ± 0.473
5.962LysAsn: 5.962 ± 0.713
3.071LysPro: 3.071 ± 0.434
4.697LysGln: 4.697 ± 0.759
3.703LysArg: 3.703 ± 0.666
4.697LysSer: 4.697 ± 0.738
5.329LysThr: 5.329 ± 0.711
5.781LysVal: 5.781 ± 0.828
1.536LysTrp: 1.536 ± 0.371
3.342LysTyr: 3.342 ± 0.645
0.0LysXaa: 0.0 ± 0.0
Leu
4.516LeuAla: 4.516 ± 0.632
0.542LeuCys: 0.542 ± 0.259
6.232LeuAsp: 6.232 ± 0.786
6.594LeuGlu: 6.594 ± 0.838
2.89LeuPhe: 2.89 ± 0.439
5.6LeuGly: 5.6 ± 0.979
0.994LeuHis: 0.994 ± 0.255
4.878LeuIle: 4.878 ± 0.78
8.581LeuLys: 8.581 ± 0.937
7.136LeuLeu: 7.136 ± 0.878
2.981LeuMet: 2.981 ± 0.444
5.149LeuAsn: 5.149 ± 0.721
2.71LeuPro: 2.71 ± 0.496
2.71LeuGln: 2.71 ± 0.572
2.981LeuArg: 2.981 ± 0.57
6.865LeuSer: 6.865 ± 0.812
4.787LeuThr: 4.787 ± 0.617
3.703LeuVal: 3.703 ± 0.54
0.903LeuTrp: 0.903 ± 0.317
3.252LeuTyr: 3.252 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.258MetAla: 2.258 ± 0.396
0.09MetCys: 0.09 ± 0.096
1.716MetAsp: 1.716 ± 0.314
1.626MetGlu: 1.626 ± 0.466
1.174MetPhe: 1.174 ± 0.269
1.536MetGly: 1.536 ± 0.373
0.271MetHis: 0.271 ± 0.167
1.355MetIle: 1.355 ± 0.339
2.439MetLys: 2.439 ± 0.606
1.626MetLeu: 1.626 ± 0.352
0.723MetMet: 0.723 ± 0.262
1.265MetAsn: 1.265 ± 0.326
1.265MetPro: 1.265 ± 0.409
0.813MetGln: 0.813 ± 0.258
0.813MetArg: 0.813 ± 0.301
1.716MetSer: 1.716 ± 0.368
2.258MetThr: 2.258 ± 0.464
1.626MetVal: 1.626 ± 0.304
0.542MetTrp: 0.542 ± 0.203
0.994MetTyr: 0.994 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
3.161AsnAla: 3.161 ± 0.45
0.542AsnCys: 0.542 ± 0.227
3.161AsnAsp: 3.161 ± 0.499
2.619AsnGlu: 2.619 ± 0.486
2.258AsnPhe: 2.258 ± 0.47
4.968AsnGly: 4.968 ± 0.779
0.903AsnHis: 0.903 ± 0.406
3.794AsnIle: 3.794 ± 0.505
5.781AsnLys: 5.781 ± 0.575
3.613AsnLeu: 3.613 ± 0.601
0.723AsnMet: 0.723 ± 0.247
3.523AsnAsn: 3.523 ± 0.504
2.439AsnPro: 2.439 ± 0.394
2.529AsnGln: 2.529 ± 0.388
2.077AsnArg: 2.077 ± 0.505
3.252AsnSer: 3.252 ± 0.478
2.619AsnThr: 2.619 ± 0.447
3.884AsnVal: 3.884 ± 0.452
0.813AsnTrp: 0.813 ± 0.283
2.8AsnTyr: 2.8 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
0.903ProAla: 0.903 ± 0.255
0.0ProCys: 0.0 ± 0.0
1.626ProAsp: 1.626 ± 0.349
2.529ProGlu: 2.529 ± 0.448
1.445ProPhe: 1.445 ± 0.472
0.723ProGly: 0.723 ± 0.217
0.542ProHis: 0.542 ± 0.236
2.168ProIle: 2.168 ± 0.453
2.89ProLys: 2.89 ± 0.521
2.8ProLeu: 2.8 ± 0.471
0.632ProMet: 0.632 ± 0.241
1.897ProAsn: 1.897 ± 0.355
0.723ProPro: 0.723 ± 0.21
2.258ProGln: 2.258 ± 0.544
1.265ProArg: 1.265 ± 0.385
1.897ProSer: 1.897 ± 0.432
2.168ProThr: 2.168 ± 0.423
1.536ProVal: 1.536 ± 0.406
0.271ProTrp: 0.271 ± 0.145
1.536ProTyr: 1.536 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
2.71GlnAla: 2.71 ± 0.502
0.361GlnCys: 0.361 ± 0.191
1.445GlnAsp: 1.445 ± 0.386
3.342GlnGlu: 3.342 ± 0.536
2.439GlnPhe: 2.439 ± 0.592
1.987GlnGly: 1.987 ± 0.381
0.271GlnHis: 0.271 ± 0.161
3.432GlnIle: 3.432 ± 0.572
3.884GlnLys: 3.884 ± 0.626
3.974GlnLeu: 3.974 ± 0.663
1.265GlnMet: 1.265 ± 0.328
2.529GlnAsn: 2.529 ± 0.419
1.355GlnPro: 1.355 ± 0.486
2.529GlnGln: 2.529 ± 0.54
1.897GlnArg: 1.897 ± 0.401
1.084GlnSer: 1.084 ± 0.395
2.258GlnThr: 2.258 ± 0.453
2.258GlnVal: 2.258 ± 0.527
0.903GlnTrp: 0.903 ± 0.27
1.536GlnTyr: 1.536 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
2.619ArgAla: 2.619 ± 0.452
0.271ArgCys: 0.271 ± 0.151
2.529ArgAsp: 2.529 ± 0.503
2.619ArgGlu: 2.619 ± 0.506
1.807ArgPhe: 1.807 ± 0.583
1.897ArgGly: 1.897 ± 0.44
0.632ArgHis: 0.632 ± 0.249
2.71ArgIle: 2.71 ± 0.453
3.252ArgLys: 3.252 ± 0.628
3.703ArgLeu: 3.703 ± 0.704
1.084ArgMet: 1.084 ± 0.356
2.258ArgAsn: 2.258 ± 0.555
0.723ArgPro: 0.723 ± 0.358
1.265ArgGln: 1.265 ± 0.331
1.716ArgArg: 1.716 ± 0.411
2.077ArgSer: 2.077 ± 0.522
1.716ArgThr: 1.716 ± 0.388
1.536ArgVal: 1.536 ± 0.332
0.542ArgTrp: 0.542 ± 0.257
1.626ArgTyr: 1.626 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
4.516SerAla: 4.516 ± 0.717
0.361SerCys: 0.361 ± 0.165
5.058SerAsp: 5.058 ± 0.598
4.155SerGlu: 4.155 ± 0.486
2.619SerPhe: 2.619 ± 0.432
5.149SerGly: 5.149 ± 0.749
0.181SerHis: 0.181 ± 0.147
4.155SerIle: 4.155 ± 0.593
5.871SerLys: 5.871 ± 0.683
4.968SerLeu: 4.968 ± 0.554
1.536SerMet: 1.536 ± 0.352
3.703SerAsn: 3.703 ± 0.553
2.258SerPro: 2.258 ± 0.524
2.529SerGln: 2.529 ± 0.46
2.439SerArg: 2.439 ± 0.43
4.787SerSer: 4.787 ± 0.995
3.523SerThr: 3.523 ± 0.484
5.329SerVal: 5.329 ± 0.562
1.084SerTrp: 1.084 ± 0.331
2.71SerTyr: 2.71 ± 0.45
0.0SerXaa: 0.0 ± 0.0
Thr
4.336ThrAla: 4.336 ± 0.682
0.181ThrCys: 0.181 ± 0.186
2.8ThrAsp: 2.8 ± 0.635
5.058ThrGlu: 5.058 ± 0.67
2.8ThrPhe: 2.8 ± 0.494
4.065ThrGly: 4.065 ± 0.731
1.265ThrHis: 1.265 ± 0.333
4.787ThrIle: 4.787 ± 0.518
5.51ThrLys: 5.51 ± 0.693
5.239ThrLeu: 5.239 ± 0.784
0.994ThrMet: 0.994 ± 0.227
2.348ThrAsn: 2.348 ± 0.561
1.897ThrPro: 1.897 ± 0.327
1.716ThrGln: 1.716 ± 0.277
2.529ThrArg: 2.529 ± 0.503
3.432ThrSer: 3.432 ± 0.506
4.607ThrThr: 4.607 ± 1.054
4.336ThrVal: 4.336 ± 0.554
0.813ThrTrp: 0.813 ± 0.307
2.619ThrTyr: 2.619 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
2.8ValAla: 2.8 ± 0.555
0.452ValCys: 0.452 ± 0.199
4.607ValAsp: 4.607 ± 0.711
3.703ValGlu: 3.703 ± 0.705
1.987ValPhe: 1.987 ± 0.353
3.432ValGly: 3.432 ± 0.579
1.174ValHis: 1.174 ± 0.41
4.426ValIle: 4.426 ± 0.546
5.058ValLys: 5.058 ± 0.583
4.155ValLeu: 4.155 ± 0.533
1.716ValMet: 1.716 ± 0.363
2.8ValAsn: 2.8 ± 0.503
1.716ValPro: 1.716 ± 0.385
1.716ValGln: 1.716 ± 0.319
2.168ValArg: 2.168 ± 0.519
4.336ValSer: 4.336 ± 0.801
4.607ValThr: 4.607 ± 0.691
3.703ValVal: 3.703 ± 0.538
0.723ValTrp: 0.723 ± 0.234
1.897ValTyr: 1.897 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
1.174TrpAla: 1.174 ± 0.31
0.0TrpCys: 0.0 ± 0.0
0.813TrpAsp: 0.813 ± 0.22
0.632TrpGlu: 0.632 ± 0.272
0.903TrpPhe: 0.903 ± 0.258
0.542TrpGly: 0.542 ± 0.227
0.542TrpHis: 0.542 ± 0.221
0.994TrpIle: 0.994 ± 0.264
0.723TrpLys: 0.723 ± 0.288
1.536TrpLeu: 1.536 ± 0.39
0.181TrpMet: 0.181 ± 0.144
0.813TrpAsn: 0.813 ± 0.26
0.181TrpPro: 0.181 ± 0.114
0.994TrpGln: 0.994 ± 0.306
0.813TrpArg: 0.813 ± 0.339
1.174TrpSer: 1.174 ± 0.387
1.174TrpThr: 1.174 ± 0.404
0.813TrpVal: 0.813 ± 0.321
0.542TrpTrp: 0.542 ± 0.223
0.542TrpTyr: 0.542 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.348TyrAla: 2.348 ± 0.554
0.723TyrCys: 0.723 ± 0.314
3.523TyrAsp: 3.523 ± 0.566
2.168TyrGlu: 2.168 ± 0.441
2.71TyrPhe: 2.71 ± 0.611
1.807TyrGly: 1.807 ± 0.407
0.542TyrHis: 0.542 ± 0.212
3.613TyrIle: 3.613 ± 0.631
3.071TyrLys: 3.071 ± 0.51
2.981TyrLeu: 2.981 ± 0.516
1.265TyrMet: 1.265 ± 0.312
1.716TyrAsn: 1.716 ± 0.4
1.174TyrPro: 1.174 ± 0.357
1.536TyrGln: 1.536 ± 0.407
2.168TyrArg: 2.168 ± 0.53
2.619TyrSer: 2.619 ± 0.48
1.716TyrThr: 1.716 ± 0.381
1.807TyrVal: 1.807 ± 0.424
0.632TyrTrp: 0.632 ± 0.227
1.807TyrTyr: 1.807 ± 0.385
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (11072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski