Amino acid dipepetide frequency for Lactococcus phage 53802

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.311AlaAla: 4.311 ± 0.824
0.176AlaCys: 0.176 ± 0.113
3.608AlaAsp: 3.608 ± 0.663
4.223AlaGlu: 4.223 ± 0.863
2.904AlaPhe: 2.904 ± 0.4
4.048AlaGly: 4.048 ± 0.709
1.144AlaHis: 1.144 ± 0.315
5.191AlaIle: 5.191 ± 0.823
5.191AlaLys: 5.191 ± 0.679
5.807AlaLeu: 5.807 ± 0.795
1.584AlaMet: 1.584 ± 0.262
4.223AlaAsn: 4.223 ± 0.602
1.232AlaPro: 1.232 ± 0.352
2.464AlaGln: 2.464 ± 0.392
2.728AlaArg: 2.728 ± 0.51
4.311AlaSer: 4.311 ± 0.748
3.52AlaThr: 3.52 ± 0.616
3.168AlaVal: 3.168 ± 0.759
0.968AlaTrp: 0.968 ± 0.258
2.728AlaTyr: 2.728 ± 0.574
0.0AlaXaa: 0.0 ± 0.0
Cys
0.088CysAla: 0.088 ± 0.089
0.0CysCys: 0.0 ± 0.0
0.528CysAsp: 0.528 ± 0.212
0.616CysGlu: 0.616 ± 0.206
0.088CysPhe: 0.088 ± 0.099
0.352CysGly: 0.352 ± 0.161
0.0CysHis: 0.0 ± 0.0
0.264CysIle: 0.264 ± 0.128
0.352CysLys: 0.352 ± 0.176
0.176CysLeu: 0.176 ± 0.176
0.0CysMet: 0.0 ± 0.0
0.264CysAsn: 0.264 ± 0.146
0.176CysPro: 0.176 ± 0.12
0.0CysGln: 0.0 ± 0.0
0.352CysArg: 0.352 ± 0.144
0.352CysSer: 0.352 ± 0.177
0.352CysThr: 0.352 ± 0.201
0.176CysVal: 0.176 ± 0.112
0.088CysTrp: 0.088 ± 0.095
0.264CysTyr: 0.264 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
2.728AspAla: 2.728 ± 0.506
0.352AspCys: 0.352 ± 0.171
4.311AspAsp: 4.311 ± 0.635
5.455AspGlu: 5.455 ± 0.933
2.904AspPhe: 2.904 ± 0.481
4.048AspGly: 4.048 ± 0.593
0.44AspHis: 0.44 ± 0.218
5.543AspIle: 5.543 ± 0.628
5.807AspLys: 5.807 ± 0.922
4.839AspLeu: 4.839 ± 0.624
1.408AspMet: 1.408 ± 0.406
3.872AspAsn: 3.872 ± 0.352
1.056AspPro: 1.056 ± 0.301
1.584AspGln: 1.584 ± 0.368
1.672AspArg: 1.672 ± 0.353
2.728AspSer: 2.728 ± 0.567
4.663AspThr: 4.663 ± 0.649
3.96AspVal: 3.96 ± 0.48
1.408AspTrp: 1.408 ± 0.308
3.344AspTyr: 3.344 ± 0.492
0.0AspXaa: 0.0 ± 0.0
Glu
4.575GluAla: 4.575 ± 0.651
0.176GluCys: 0.176 ± 0.104
2.728GluAsp: 2.728 ± 0.526
5.895GluGlu: 5.895 ± 1.091
3.696GluPhe: 3.696 ± 0.479
3.608GluGly: 3.608 ± 0.625
1.408GluHis: 1.408 ± 0.468
5.103GluIle: 5.103 ± 0.716
6.511GluLys: 6.511 ± 0.957
8.887GluLeu: 8.887 ± 0.83
2.024GluMet: 2.024 ± 0.443
3.784GluAsn: 3.784 ± 0.626
2.024GluPro: 2.024 ± 0.504
4.048GluGln: 4.048 ± 0.711
3.168GluArg: 3.168 ± 0.58
3.696GluSer: 3.696 ± 0.606
4.223GluThr: 4.223 ± 0.716
5.543GluVal: 5.543 ± 0.779
0.968GluTrp: 0.968 ± 0.287
2.728GluTyr: 2.728 ± 0.509
0.0GluXaa: 0.0 ± 0.0
Phe
2.112PheAla: 2.112 ± 0.394
0.616PheCys: 0.616 ± 0.232
3.696PheAsp: 3.696 ± 0.48
3.52PheGlu: 3.52 ± 0.667
1.408PhePhe: 1.408 ± 0.359
2.64PheGly: 2.64 ± 0.53
0.968PheHis: 0.968 ± 0.298
3.344PheIle: 3.344 ± 0.482
4.487PheLys: 4.487 ± 0.656
2.816PheLeu: 2.816 ± 0.439
1.496PheMet: 1.496 ± 0.464
3.256PheAsn: 3.256 ± 0.605
0.704PhePro: 0.704 ± 0.301
1.32PheGln: 1.32 ± 0.37
1.496PheArg: 1.496 ± 0.302
3.344PheSer: 3.344 ± 0.514
3.52PheThr: 3.52 ± 0.77
2.2PheVal: 2.2 ± 0.466
0.088PheTrp: 0.088 ± 0.088
1.936PheTyr: 1.936 ± 0.475
0.0PheXaa: 0.0 ± 0.0
Gly
4.399GlyAla: 4.399 ± 0.881
0.264GlyCys: 0.264 ± 0.109
3.872GlyAsp: 3.872 ± 0.687
2.816GlyGlu: 2.816 ± 0.453
2.376GlyPhe: 2.376 ± 0.447
5.279GlyGly: 5.279 ± 1.004
0.352GlyHis: 0.352 ± 0.163
5.191GlyIle: 5.191 ± 0.85
5.807GlyLys: 5.807 ± 0.726
5.103GlyLeu: 5.103 ± 0.697
2.112GlyMet: 2.112 ± 0.512
3.784GlyAsn: 3.784 ± 0.669
0.88GlyPro: 0.88 ± 0.287
2.552GlyGln: 2.552 ± 0.628
2.112GlyArg: 2.112 ± 0.483
3.872GlySer: 3.872 ± 0.528
4.575GlyThr: 4.575 ± 0.772
5.015GlyVal: 5.015 ± 0.849
0.792GlyTrp: 0.792 ± 0.291
3.168GlyTyr: 3.168 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.968HisAla: 0.968 ± 0.318
0.088HisCys: 0.088 ± 0.097
0.616HisAsp: 0.616 ± 0.265
1.232HisGlu: 1.232 ± 0.365
0.704HisPhe: 0.704 ± 0.29
0.704HisGly: 0.704 ± 0.231
0.264HisHis: 0.264 ± 0.157
0.528HisIle: 0.528 ± 0.251
1.144HisLys: 1.144 ± 0.272
0.704HisLeu: 0.704 ± 0.24
0.528HisMet: 0.528 ± 0.198
0.616HisAsn: 0.616 ± 0.249
0.528HisPro: 0.528 ± 0.163
0.792HisGln: 0.792 ± 0.244
0.352HisArg: 0.352 ± 0.163
0.704HisSer: 0.704 ± 0.26
0.352HisThr: 0.352 ± 0.173
1.144HisVal: 1.144 ± 0.375
0.176HisTrp: 0.176 ± 0.153
1.056HisTyr: 1.056 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
3.872IleAla: 3.872 ± 0.587
0.616IleCys: 0.616 ± 0.237
4.136IleAsp: 4.136 ± 0.607
5.983IleGlu: 5.983 ± 0.809
2.728IlePhe: 2.728 ± 0.612
4.487IleGly: 4.487 ± 0.595
1.056IleHis: 1.056 ± 0.432
4.487IleIle: 4.487 ± 0.688
6.775IleLys: 6.775 ± 0.709
4.048IleLeu: 4.048 ± 0.61
1.496IleMet: 1.496 ± 0.295
5.015IleAsn: 5.015 ± 0.611
1.76IlePro: 1.76 ± 0.4
4.223IleGln: 4.223 ± 0.619
2.64IleArg: 2.64 ± 0.476
5.103IleSer: 5.103 ± 0.731
3.96IleThr: 3.96 ± 0.567
3.608IleVal: 3.608 ± 0.678
0.968IleTrp: 0.968 ± 0.265
2.992IleTyr: 2.992 ± 0.577
0.0IleXaa: 0.0 ± 0.0
Lys
6.071LysAla: 6.071 ± 0.956
0.176LysCys: 0.176 ± 0.14
4.927LysAsp: 4.927 ± 0.622
6.951LysGlu: 6.951 ± 0.828
3.696LysPhe: 3.696 ± 0.526
4.663LysGly: 4.663 ± 0.663
2.024LysHis: 2.024 ± 0.504
5.103LysIle: 5.103 ± 0.609
8.183LysLys: 8.183 ± 1.159
8.271LysLeu: 8.271 ± 1.157
1.76LysMet: 1.76 ± 0.326
6.159LysAsn: 6.159 ± 0.797
2.2LysPro: 2.2 ± 0.342
4.487LysGln: 4.487 ± 0.595
3.696LysArg: 3.696 ± 0.81
5.279LysSer: 5.279 ± 0.761
5.279LysThr: 5.279 ± 0.636
5.367LysVal: 5.367 ± 0.738
0.704LysTrp: 0.704 ± 0.227
3.784LysTyr: 3.784 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
5.631LeuAla: 5.631 ± 0.675
0.088LeuCys: 0.088 ± 0.091
6.071LeuAsp: 6.071 ± 0.656
5.279LeuGlu: 5.279 ± 0.847
2.992LeuPhe: 2.992 ± 0.498
4.839LeuGly: 4.839 ± 0.615
0.968LeuHis: 0.968 ± 0.311
5.191LeuIle: 5.191 ± 0.71
6.951LeuLys: 6.951 ± 0.782
6.159LeuLeu: 6.159 ± 0.745
1.32LeuMet: 1.32 ± 0.335
4.839LeuAsn: 4.839 ± 0.635
3.432LeuPro: 3.432 ± 0.534
3.872LeuGln: 3.872 ± 0.704
1.584LeuArg: 1.584 ± 0.472
6.951LeuSer: 6.951 ± 0.767
5.367LeuThr: 5.367 ± 0.719
3.432LeuVal: 3.432 ± 0.647
0.88LeuTrp: 0.88 ± 0.282
2.64LeuTyr: 2.64 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
1.672MetAla: 1.672 ± 0.368
0.088MetCys: 0.088 ± 0.098
1.496MetAsp: 1.496 ± 0.361
1.408MetGlu: 1.408 ± 0.296
0.792MetPhe: 0.792 ± 0.215
1.144MetGly: 1.144 ± 0.327
0.264MetHis: 0.264 ± 0.152
1.408MetIle: 1.408 ± 0.265
2.376MetLys: 2.376 ± 0.566
0.968MetLeu: 0.968 ± 0.301
0.44MetMet: 0.44 ± 0.222
2.024MetAsn: 2.024 ± 0.479
0.44MetPro: 0.44 ± 0.241
1.32MetGln: 1.32 ± 0.308
1.144MetArg: 1.144 ± 0.328
1.584MetSer: 1.584 ± 0.317
3.168MetThr: 3.168 ± 0.519
0.792MetVal: 0.792 ± 0.244
0.264MetTrp: 0.264 ± 0.163
0.792MetTyr: 0.792 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
3.52AsnAla: 3.52 ± 0.598
0.176AsnCys: 0.176 ± 0.106
3.08AsnAsp: 3.08 ± 0.391
4.136AsnGlu: 4.136 ± 0.53
2.728AsnPhe: 2.728 ± 0.406
5.983AsnGly: 5.983 ± 1.158
0.968AsnHis: 0.968 ± 0.337
4.223AsnIle: 4.223 ± 0.52
5.191AsnLys: 5.191 ± 0.545
4.487AsnLeu: 4.487 ± 0.633
1.32AsnMet: 1.32 ± 0.362
3.344AsnAsn: 3.344 ± 0.594
2.816AsnPro: 2.816 ± 0.532
2.728AsnGln: 2.728 ± 0.392
1.848AsnArg: 1.848 ± 0.463
4.311AsnSer: 4.311 ± 0.542
3.168AsnThr: 3.168 ± 0.428
4.223AsnVal: 4.223 ± 0.666
0.616AsnTrp: 0.616 ± 0.249
2.376AsnTyr: 2.376 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
1.144ProAla: 1.144 ± 0.329
0.0ProCys: 0.0 ± 0.0
2.112ProAsp: 2.112 ± 0.498
1.584ProGlu: 1.584 ± 0.39
1.584ProPhe: 1.584 ± 0.379
0.616ProGly: 0.616 ± 0.259
0.616ProHis: 0.616 ± 0.218
1.672ProIle: 1.672 ± 0.481
3.168ProLys: 3.168 ± 0.586
2.024ProLeu: 2.024 ± 0.372
0.352ProMet: 0.352 ± 0.184
1.76ProAsn: 1.76 ± 0.532
0.616ProPro: 0.616 ± 0.22
0.792ProGln: 0.792 ± 0.24
0.968ProArg: 0.968 ± 0.342
2.2ProSer: 2.2 ± 0.492
1.76ProThr: 1.76 ± 0.345
2.2ProVal: 2.2 ± 0.379
0.352ProTrp: 0.352 ± 0.16
0.704ProTyr: 0.704 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
4.575GlnAla: 4.575 ± 0.646
0.088GlnCys: 0.088 ± 0.079
1.32GlnAsp: 1.32 ± 0.345
4.575GlnGlu: 4.575 ± 0.597
1.408GlnPhe: 1.408 ± 0.371
2.552GlnGly: 2.552 ± 0.542
0.528GlnHis: 0.528 ± 0.187
2.904GlnIle: 2.904 ± 0.455
3.696GlnLys: 3.696 ± 0.594
3.256GlnLeu: 3.256 ± 0.536
1.232GlnMet: 1.232 ± 0.399
2.376GlnAsn: 2.376 ± 0.476
1.496GlnPro: 1.496 ± 0.57
2.112GlnGln: 2.112 ± 0.438
1.408GlnArg: 1.408 ± 0.356
2.552GlnSer: 2.552 ± 0.594
2.816GlnThr: 2.816 ± 0.443
3.08GlnVal: 3.08 ± 0.593
0.792GlnTrp: 0.792 ± 0.265
1.848GlnTyr: 1.848 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
1.76ArgAla: 1.76 ± 0.45
0.44ArgCys: 0.44 ± 0.195
2.2ArgAsp: 2.2 ± 0.387
2.2ArgGlu: 2.2 ± 0.519
2.376ArgPhe: 2.376 ± 0.456
2.112ArgGly: 2.112 ± 0.404
0.264ArgHis: 0.264 ± 0.199
2.816ArgIle: 2.816 ± 0.603
2.992ArgLys: 2.992 ± 0.515
2.992ArgLeu: 2.992 ± 0.669
1.144ArgMet: 1.144 ± 0.359
1.584ArgAsn: 1.584 ± 0.378
0.616ArgPro: 0.616 ± 0.326
1.496ArgGln: 1.496 ± 0.321
1.232ArgArg: 1.232 ± 0.329
1.848ArgSer: 1.848 ± 0.293
1.848ArgThr: 1.848 ± 0.306
2.024ArgVal: 2.024 ± 0.313
0.176ArgTrp: 0.176 ± 0.128
2.024ArgTyr: 2.024 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
4.311SerAla: 4.311 ± 0.654
0.352SerCys: 0.352 ± 0.206
4.927SerAsp: 4.927 ± 0.575
5.455SerGlu: 5.455 ± 0.705
3.344SerPhe: 3.344 ± 0.575
5.279SerGly: 5.279 ± 0.858
0.528SerHis: 0.528 ± 0.225
3.344SerIle: 3.344 ± 0.512
5.279SerLys: 5.279 ± 0.988
5.103SerLeu: 5.103 ± 0.811
1.496SerMet: 1.496 ± 0.381
4.751SerAsn: 4.751 ± 0.74
0.616SerPro: 0.616 ± 0.283
3.168SerGln: 3.168 ± 0.549
1.496SerArg: 1.496 ± 0.26
5.015SerSer: 5.015 ± 0.86
4.223SerThr: 4.223 ± 0.525
4.575SerVal: 4.575 ± 0.535
1.056SerTrp: 1.056 ± 0.287
2.464SerTyr: 2.464 ± 0.437
0.0SerXaa: 0.0 ± 0.0
Thr
4.927ThrAla: 4.927 ± 0.698
0.44ThrCys: 0.44 ± 0.176
4.311ThrAsp: 4.311 ± 0.706
4.399ThrGlu: 4.399 ± 0.601
3.696ThrPhe: 3.696 ± 0.577
5.191ThrGly: 5.191 ± 0.654
0.352ThrHis: 0.352 ± 0.157
4.751ThrIle: 4.751 ± 0.669
4.663ThrLys: 4.663 ± 0.583
4.223ThrLeu: 4.223 ± 0.653
0.968ThrMet: 0.968 ± 0.33
3.256ThrAsn: 3.256 ± 0.586
2.2ThrPro: 2.2 ± 0.515
2.464ThrGln: 2.464 ± 0.439
2.64ThrArg: 2.64 ± 0.541
3.344ThrSer: 3.344 ± 0.461
3.52ThrThr: 3.52 ± 0.537
5.279ThrVal: 5.279 ± 0.885
0.704ThrTrp: 0.704 ± 0.211
1.672ThrTyr: 1.672 ± 0.627
0.0ThrXaa: 0.0 ± 0.0
Val
3.872ValAla: 3.872 ± 0.735
0.0ValCys: 0.0 ± 0.0
4.487ValAsp: 4.487 ± 0.529
5.455ValGlu: 5.455 ± 0.681
3.08ValPhe: 3.08 ± 0.519
3.256ValGly: 3.256 ± 0.534
0.616ValHis: 0.616 ± 0.234
5.191ValIle: 5.191 ± 0.743
5.719ValLys: 5.719 ± 0.802
4.575ValLeu: 4.575 ± 0.641
1.408ValMet: 1.408 ± 0.417
3.96ValAsn: 3.96 ± 0.598
1.76ValPro: 1.76 ± 0.412
2.464ValGln: 2.464 ± 0.486
1.232ValArg: 1.232 ± 0.302
4.927ValSer: 4.927 ± 0.545
3.432ValThr: 3.432 ± 0.73
3.784ValVal: 3.784 ± 0.471
0.704ValTrp: 0.704 ± 0.229
2.552ValTyr: 2.552 ± 0.555
0.0ValXaa: 0.0 ± 0.0
Trp
1.056TrpAla: 1.056 ± 0.292
0.0TrpCys: 0.0 ± 0.0
0.528TrpAsp: 0.528 ± 0.175
0.968TrpGlu: 0.968 ± 0.313
0.616TrpPhe: 0.616 ± 0.219
0.44TrpGly: 0.44 ± 0.189
0.088TrpHis: 0.088 ± 0.079
1.056TrpIle: 1.056 ± 0.232
0.792TrpLys: 0.792 ± 0.234
1.232TrpLeu: 1.232 ± 0.355
0.176TrpMet: 0.176 ± 0.113
0.528TrpAsn: 0.528 ± 0.214
0.176TrpPro: 0.176 ± 0.119
1.144TrpGln: 1.144 ± 0.337
0.528TrpArg: 0.528 ± 0.191
0.792TrpSer: 0.792 ± 0.249
1.056TrpThr: 1.056 ± 0.358
0.704TrpVal: 0.704 ± 0.245
0.088TrpTrp: 0.088 ± 0.088
0.616TrpTyr: 0.616 ± 0.247
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.936TyrAla: 1.936 ± 0.326
0.352TyrCys: 0.352 ± 0.192
3.168TyrAsp: 3.168 ± 0.592
2.64TyrGlu: 2.64 ± 0.516
1.848TyrPhe: 1.848 ± 0.424
3.168TyrGly: 3.168 ± 0.531
0.352TyrHis: 0.352 ± 0.285
2.64TyrIle: 2.64 ± 0.529
3.696TyrLys: 3.696 ± 0.544
2.816TyrLeu: 2.816 ± 0.507
1.232TyrMet: 1.232 ± 0.362
1.848TyrAsn: 1.848 ± 0.413
1.496TyrPro: 1.496 ± 0.387
1.584TyrGln: 1.584 ± 0.335
1.848TyrArg: 1.848 ± 0.421
3.872TyrSer: 3.872 ± 0.572
2.2TyrThr: 2.2 ± 0.472
2.2TyrVal: 2.2 ± 0.38
0.792TyrTrp: 0.792 ± 0.243
1.056TyrTyr: 1.056 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11366 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski