Amino acid dipepetide frequency for Lactobacillus prophage Lj928

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.761AlaAla: 4.761 ± 1.185
0.433AlaCys: 0.433 ± 0.249
4.675AlaAsp: 4.675 ± 0.68
4.588AlaGlu: 4.588 ± 0.671
2.251AlaPhe: 2.251 ± 0.505
3.463AlaGly: 3.463 ± 0.723
1.039AlaHis: 1.039 ± 0.41
6.406AlaIle: 6.406 ± 1.199
7.618AlaLys: 7.618 ± 0.991
6.666AlaLeu: 6.666 ± 1.132
1.991AlaMet: 1.991 ± 0.505
4.069AlaAsn: 4.069 ± 0.607
1.212AlaPro: 1.212 ± 0.386
3.376AlaGln: 3.376 ± 0.709
2.857AlaArg: 2.857 ± 0.574
4.848AlaSer: 4.848 ± 1.0
4.155AlaThr: 4.155 ± 0.678
3.723AlaVal: 3.723 ± 0.614
1.039AlaTrp: 1.039 ± 0.276
2.337AlaTyr: 2.337 ± 0.457
0.0AlaXaa: 0.0 ± 0.0
Cys
0.087CysAla: 0.087 ± 0.088
0.087CysCys: 0.087 ± 0.098
0.433CysAsp: 0.433 ± 0.198
0.173CysGlu: 0.173 ± 0.135
0.26CysPhe: 0.26 ± 0.168
0.519CysGly: 0.519 ± 0.285
0.173CysHis: 0.173 ± 0.133
0.346CysIle: 0.346 ± 0.192
0.433CysLys: 0.433 ± 0.255
0.693CysLeu: 0.693 ± 0.294
0.173CysMet: 0.173 ± 0.14
0.26CysAsn: 0.26 ± 0.157
0.087CysPro: 0.087 ± 0.1
0.346CysGln: 0.346 ± 0.161
0.087CysArg: 0.087 ± 0.088
0.433CysSer: 0.433 ± 0.19
0.173CysThr: 0.173 ± 0.117
0.26CysVal: 0.26 ± 0.147
0.26CysTrp: 0.26 ± 0.173
0.173CysTyr: 0.173 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
5.368AspAla: 5.368 ± 0.766
0.173AspCys: 0.173 ± 0.127
4.415AspAsp: 4.415 ± 0.758
4.588AspGlu: 4.588 ± 0.633
2.943AspPhe: 2.943 ± 0.57
4.675AspGly: 4.675 ± 0.748
0.779AspHis: 0.779 ± 0.264
4.502AspIle: 4.502 ± 0.638
5.887AspLys: 5.887 ± 0.781
5.368AspLeu: 5.368 ± 0.719
1.039AspMet: 1.039 ± 0.241
4.502AspAsn: 4.502 ± 0.643
1.905AspPro: 1.905 ± 0.673
2.77AspGln: 2.77 ± 0.538
2.857AspArg: 2.857 ± 0.534
4.242AspSer: 4.242 ± 0.732
4.069AspThr: 4.069 ± 0.472
3.203AspVal: 3.203 ± 0.526
1.385AspTrp: 1.385 ± 0.447
3.636AspTyr: 3.636 ± 0.802
0.0AspXaa: 0.0 ± 0.0
Glu
3.809GluAla: 3.809 ± 0.553
0.346GluCys: 0.346 ± 0.206
3.982GluAsp: 3.982 ± 0.556
4.329GluGlu: 4.329 ± 0.618
1.991GluPhe: 1.991 ± 0.421
2.77GluGly: 2.77 ± 0.429
0.952GluHis: 0.952 ± 0.248
3.809GluIle: 3.809 ± 0.694
6.147GluLys: 6.147 ± 0.603
4.935GluLeu: 4.935 ± 0.789
1.558GluMet: 1.558 ± 0.419
3.203GluAsn: 3.203 ± 0.574
1.472GluPro: 1.472 ± 0.396
2.511GluGln: 2.511 ± 0.486
1.905GluArg: 1.905 ± 0.359
3.549GluSer: 3.549 ± 0.588
3.809GluThr: 3.809 ± 0.55
3.809GluVal: 3.809 ± 0.74
0.433GluTrp: 0.433 ± 0.197
2.77GluTyr: 2.77 ± 0.581
0.0GluXaa: 0.0 ± 0.0
Phe
1.645PheAla: 1.645 ± 0.42
0.693PheCys: 0.693 ± 0.23
4.155PheAsp: 4.155 ± 0.676
2.251PheGlu: 2.251 ± 0.443
1.299PhePhe: 1.299 ± 0.383
2.511PheGly: 2.511 ± 0.447
0.779PheHis: 0.779 ± 0.203
2.684PheIle: 2.684 ± 0.544
3.376PheLys: 3.376 ± 0.577
1.991PheLeu: 1.991 ± 0.409
1.039PheMet: 1.039 ± 0.302
3.29PheAsn: 3.29 ± 0.621
1.212PhePro: 1.212 ± 0.296
0.952PheGln: 0.952 ± 0.234
0.866PheArg: 0.866 ± 0.229
2.424PheSer: 2.424 ± 0.435
2.511PheThr: 2.511 ± 0.528
1.385PheVal: 1.385 ± 0.313
0.26PheTrp: 0.26 ± 0.152
1.039PheTyr: 1.039 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
3.376GlyAla: 3.376 ± 0.842
0.26GlyCys: 0.26 ± 0.299
3.03GlyAsp: 3.03 ± 0.534
2.857GlyGlu: 2.857 ± 0.418
2.337GlyPhe: 2.337 ± 0.48
2.77GlyGly: 2.77 ± 0.6
1.731GlyHis: 1.731 ± 0.504
4.502GlyIle: 4.502 ± 0.756
6.406GlyLys: 6.406 ± 0.825
5.541GlyLeu: 5.541 ± 0.675
1.731GlyMet: 1.731 ± 0.362
2.857GlyAsn: 2.857 ± 0.525
0.779GlyPro: 0.779 ± 0.308
1.818GlyGln: 1.818 ± 0.365
1.905GlyArg: 1.905 ± 0.383
4.242GlySer: 4.242 ± 0.931
4.588GlyThr: 4.588 ± 0.759
3.809GlyVal: 3.809 ± 0.573
0.606GlyTrp: 0.606 ± 0.216
3.636GlyTyr: 3.636 ± 0.716
0.0GlyXaa: 0.0 ± 0.0
His
0.866HisAla: 0.866 ± 0.304
0.26HisCys: 0.26 ± 0.16
0.693HisAsp: 0.693 ± 0.248
1.125HisGlu: 1.125 ± 0.253
0.433HisPhe: 0.433 ± 0.251
1.299HisGly: 1.299 ± 0.38
0.433HisHis: 0.433 ± 0.267
1.039HisIle: 1.039 ± 0.294
1.731HisLys: 1.731 ± 0.379
1.212HisLeu: 1.212 ± 0.364
0.26HisMet: 0.26 ± 0.154
0.952HisAsn: 0.952 ± 0.34
0.779HisPro: 0.779 ± 0.304
0.693HisGln: 0.693 ± 0.246
0.26HisArg: 0.26 ± 0.14
1.385HisSer: 1.385 ± 0.357
1.212HisThr: 1.212 ± 0.334
0.952HisVal: 0.952 ± 0.339
0.26HisTrp: 0.26 ± 0.164
0.952HisTyr: 0.952 ± 0.354
0.0HisXaa: 0.0 ± 0.0
Ile
5.108IleAla: 5.108 ± 0.66
0.173IleCys: 0.173 ± 0.129
5.021IleAsp: 5.021 ± 0.609
4.329IleGlu: 4.329 ± 0.489
2.77IlePhe: 2.77 ± 0.496
4.069IleGly: 4.069 ± 0.465
0.779IleHis: 0.779 ± 0.226
3.203IleIle: 3.203 ± 0.735
5.714IleLys: 5.714 ± 0.693
3.982IleLeu: 3.982 ± 0.859
1.299IleMet: 1.299 ± 0.399
6.06IleAsn: 6.06 ± 0.688
2.078IlePro: 2.078 ± 0.622
2.597IleGln: 2.597 ± 0.552
3.203IleArg: 3.203 ± 0.501
6.406IleSer: 6.406 ± 1.073
5.021IleThr: 5.021 ± 0.753
3.636IleVal: 3.636 ± 0.534
0.433IleTrp: 0.433 ± 0.237
2.078IleTyr: 2.078 ± 0.479
0.0IleXaa: 0.0 ± 0.0
Lys
7.012LysAla: 7.012 ± 0.866
0.26LysCys: 0.26 ± 0.152
5.108LysAsp: 5.108 ± 0.441
6.58LysGlu: 6.58 ± 0.872
2.943LysPhe: 2.943 ± 0.628
4.761LysGly: 4.761 ± 0.839
1.731LysHis: 1.731 ± 0.447
5.887LysIle: 5.887 ± 0.732
8.571LysLys: 8.571 ± 1.269
7.272LysLeu: 7.272 ± 1.069
1.818LysMet: 1.818 ± 0.343
4.502LysAsn: 4.502 ± 0.67
3.203LysPro: 3.203 ± 0.67
5.454LysGln: 5.454 ± 0.938
3.636LysArg: 3.636 ± 0.665
7.099LysSer: 7.099 ± 1.226
6.753LysThr: 6.753 ± 0.689
6.58LysVal: 6.58 ± 0.89
1.125LysTrp: 1.125 ± 0.232
3.723LysTyr: 3.723 ± 0.719
0.0LysXaa: 0.0 ± 0.0
Leu
5.194LeuAla: 5.194 ± 0.728
0.26LeuCys: 0.26 ± 0.139
6.493LeuAsp: 6.493 ± 0.803
4.502LeuGlu: 4.502 ± 0.843
2.684LeuPhe: 2.684 ± 0.677
4.935LeuGly: 4.935 ± 0.719
0.693LeuHis: 0.693 ± 0.241
5.368LeuIle: 5.368 ± 0.757
8.398LeuLys: 8.398 ± 0.778
6.753LeuLeu: 6.753 ± 0.841
1.818LeuMet: 1.818 ± 0.467
5.887LeuAsn: 5.887 ± 0.894
3.203LeuPro: 3.203 ± 0.437
3.463LeuGln: 3.463 ± 0.561
3.376LeuArg: 3.376 ± 0.593
4.935LeuSer: 4.935 ± 0.729
4.588LeuThr: 4.588 ± 0.547
3.636LeuVal: 3.636 ± 0.46
0.606LeuTrp: 0.606 ± 0.208
2.943LeuTyr: 2.943 ± 0.422
0.0LeuXaa: 0.0 ± 0.0
Met
1.818MetAla: 1.818 ± 0.418
0.26MetCys: 0.26 ± 0.155
1.645MetAsp: 1.645 ± 0.374
1.039MetGlu: 1.039 ± 0.237
0.952MetPhe: 0.952 ± 0.287
1.212MetGly: 1.212 ± 0.308
0.346MetHis: 0.346 ± 0.186
1.039MetIle: 1.039 ± 0.401
2.943MetLys: 2.943 ± 0.643
1.818MetLeu: 1.818 ± 0.41
1.212MetMet: 1.212 ± 0.348
1.472MetAsn: 1.472 ± 0.354
0.779MetPro: 0.779 ± 0.222
1.558MetGln: 1.558 ± 0.594
1.125MetArg: 1.125 ± 0.321
1.472MetSer: 1.472 ± 0.335
1.645MetThr: 1.645 ± 0.34
1.125MetVal: 1.125 ± 0.32
0.346MetTrp: 0.346 ± 0.162
0.693MetTyr: 0.693 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
4.155AsnAla: 4.155 ± 0.743
0.346AsnCys: 0.346 ± 0.168
4.935AsnAsp: 4.935 ± 0.639
3.376AsnGlu: 3.376 ± 0.657
2.251AsnPhe: 2.251 ± 0.535
4.761AsnGly: 4.761 ± 0.658
1.212AsnHis: 1.212 ± 0.269
4.502AsnIle: 4.502 ± 0.719
5.887AsnLys: 5.887 ± 0.9
5.541AsnLeu: 5.541 ± 0.585
1.818AsnMet: 1.818 ± 0.321
5.454AsnAsn: 5.454 ± 0.753
2.943AsnPro: 2.943 ± 0.531
3.117AsnGln: 3.117 ± 0.371
2.164AsnArg: 2.164 ± 0.415
3.896AsnSer: 3.896 ± 0.678
2.77AsnThr: 2.77 ± 0.379
2.511AsnVal: 2.511 ± 0.448
0.866AsnTrp: 0.866 ± 0.281
1.905AsnTyr: 1.905 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
1.905ProAla: 1.905 ± 0.288
0.087ProCys: 0.087 ± 0.085
2.164ProAsp: 2.164 ± 0.649
1.385ProGlu: 1.385 ± 0.436
1.125ProPhe: 1.125 ± 0.28
0.952ProGly: 0.952 ± 0.305
0.693ProHis: 0.693 ± 0.246
1.991ProIle: 1.991 ± 0.396
3.463ProLys: 3.463 ± 0.831
2.251ProLeu: 2.251 ± 0.493
0.346ProMet: 0.346 ± 0.148
2.164ProAsn: 2.164 ± 0.452
0.693ProPro: 0.693 ± 0.295
1.385ProGln: 1.385 ± 0.341
1.125ProArg: 1.125 ± 0.293
2.251ProSer: 2.251 ± 0.582
2.511ProThr: 2.511 ± 0.51
1.299ProVal: 1.299 ± 0.354
0.346ProTrp: 0.346 ± 0.182
1.472ProTyr: 1.472 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
2.77GlnAla: 2.77 ± 0.539
0.26GlnCys: 0.26 ± 0.172
2.164GlnAsp: 2.164 ± 0.432
2.424GlnGlu: 2.424 ± 0.492
1.385GlnPhe: 1.385 ± 0.389
2.857GlnGly: 2.857 ± 0.417
0.779GlnHis: 0.779 ± 0.206
4.415GlnIle: 4.415 ± 0.52
3.723GlnLys: 3.723 ± 0.548
4.502GlnLeu: 4.502 ± 0.542
1.385GlnMet: 1.385 ± 0.295
3.376GlnAsn: 3.376 ± 0.61
1.212GlnPro: 1.212 ± 0.317
3.29GlnGln: 3.29 ± 0.674
1.385GlnArg: 1.385 ± 0.327
2.857GlnSer: 2.857 ± 0.627
2.251GlnThr: 2.251 ± 0.46
2.251GlnVal: 2.251 ± 0.38
0.866GlnTrp: 0.866 ± 0.348
1.818GlnTyr: 1.818 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
2.511ArgAla: 2.511 ± 0.444
0.173ArgCys: 0.173 ± 0.13
3.549ArgAsp: 3.549 ± 0.563
1.645ArgGlu: 1.645 ± 0.424
1.905ArgPhe: 1.905 ± 0.487
1.818ArgGly: 1.818 ± 0.295
0.433ArgHis: 0.433 ± 0.229
2.511ArgIle: 2.511 ± 0.318
3.809ArgLys: 3.809 ± 0.702
3.29ArgLeu: 3.29 ± 0.545
0.866ArgMet: 0.866 ± 0.262
1.558ArgAsn: 1.558 ± 0.258
0.866ArgPro: 0.866 ± 0.303
2.078ArgGln: 2.078 ± 0.427
0.952ArgArg: 0.952 ± 0.299
3.203ArgSer: 3.203 ± 0.617
1.212ArgThr: 1.212 ± 0.321
2.164ArgVal: 2.164 ± 0.359
0.346ArgTrp: 0.346 ± 0.187
1.212ArgTyr: 1.212 ± 0.314
0.0ArgXaa: 0.0 ± 0.0
Ser
6.147SerAla: 6.147 ± 1.221
0.173SerCys: 0.173 ± 0.133
3.982SerAsp: 3.982 ± 0.547
3.809SerGlu: 3.809 ± 0.464
2.511SerPhe: 2.511 ± 0.437
5.281SerGly: 5.281 ± 1.086
0.952SerHis: 0.952 ± 0.388
4.848SerIle: 4.848 ± 0.792
5.8SerLys: 5.8 ± 0.735
5.021SerLeu: 5.021 ± 0.713
1.645SerMet: 1.645 ± 0.56
5.714SerAsn: 5.714 ± 0.684
1.818SerPro: 1.818 ± 0.348
2.684SerGln: 2.684 ± 0.473
2.597SerArg: 2.597 ± 0.382
7.618SerSer: 7.618 ± 1.402
4.502SerThr: 4.502 ± 1.22
3.549SerVal: 3.549 ± 0.621
1.212SerTrp: 1.212 ± 0.264
3.203SerTyr: 3.203 ± 0.539
0.0SerXaa: 0.0 ± 0.0
Thr
6.147ThrAla: 6.147 ± 1.194
0.26ThrCys: 0.26 ± 0.211
4.155ThrAsp: 4.155 ± 0.677
3.117ThrGlu: 3.117 ± 0.593
2.337ThrPhe: 2.337 ± 0.557
3.982ThrGly: 3.982 ± 0.659
1.299ThrHis: 1.299 ± 0.363
5.021ThrIle: 5.021 ± 0.584
4.675ThrLys: 4.675 ± 0.704
4.675ThrLeu: 4.675 ± 0.532
1.472ThrMet: 1.472 ± 0.358
2.943ThrAsn: 2.943 ± 0.452
2.337ThrPro: 2.337 ± 0.608
2.77ThrGln: 2.77 ± 0.465
2.337ThrArg: 2.337 ± 0.49
3.463ThrSer: 3.463 ± 0.526
3.723ThrThr: 3.723 ± 0.671
4.502ThrVal: 4.502 ± 0.561
1.039ThrTrp: 1.039 ± 0.433
1.991ThrTyr: 1.991 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
4.935ValAla: 4.935 ± 0.801
0.087ValCys: 0.087 ± 0.1
4.502ValAsp: 4.502 ± 0.679
3.117ValGlu: 3.117 ± 0.528
1.905ValPhe: 1.905 ± 0.475
2.943ValGly: 2.943 ± 0.526
1.125ValHis: 1.125 ± 0.381
3.29ValIle: 3.29 ± 0.555
5.021ValLys: 5.021 ± 0.75
3.463ValLeu: 3.463 ± 0.526
1.818ValMet: 1.818 ± 0.397
3.636ValAsn: 3.636 ± 0.496
1.212ValPro: 1.212 ± 0.267
2.164ValGln: 2.164 ± 0.358
1.472ValArg: 1.472 ± 0.348
4.675ValSer: 4.675 ± 0.563
4.069ValThr: 4.069 ± 0.615
3.809ValVal: 3.809 ± 0.608
0.346ValTrp: 0.346 ± 0.173
1.905ValTyr: 1.905 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.866TrpAla: 0.866 ± 0.25
0.0TrpCys: 0.0 ± 0.0
0.606TrpAsp: 0.606 ± 0.239
0.866TrpGlu: 0.866 ± 0.283
0.346TrpPhe: 0.346 ± 0.167
0.866TrpGly: 0.866 ± 0.337
0.173TrpHis: 0.173 ± 0.129
0.693TrpIle: 0.693 ± 0.251
1.125TrpLys: 1.125 ± 0.275
1.299TrpLeu: 1.299 ± 0.296
0.087TrpMet: 0.087 ± 0.106
0.693TrpAsn: 0.693 ± 0.279
0.26TrpPro: 0.26 ± 0.152
0.779TrpGln: 0.779 ± 0.272
0.346TrpArg: 0.346 ± 0.144
0.779TrpSer: 0.779 ± 0.185
0.866TrpThr: 0.866 ± 0.266
0.866TrpVal: 0.866 ± 0.259
0.087TrpTrp: 0.087 ± 0.093
0.693TrpTyr: 0.693 ± 0.333
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.03TyrAla: 3.03 ± 0.58
0.866TyrCys: 0.866 ± 0.377
2.597TyrAsp: 2.597 ± 0.498
1.991TyrGlu: 1.991 ± 0.487
1.731TyrPhe: 1.731 ± 0.356
2.337TyrGly: 2.337 ± 0.581
0.693TyrHis: 0.693 ± 0.226
1.905TyrIle: 1.905 ± 0.542
3.29TyrLys: 3.29 ± 0.813
3.463TyrLeu: 3.463 ± 0.611
1.039TyrMet: 1.039 ± 0.3
1.818TyrAsn: 1.818 ± 0.39
1.558TyrPro: 1.558 ± 0.422
2.164TyrGln: 2.164 ± 0.421
1.731TyrArg: 1.731 ± 0.396
3.29TyrSer: 3.29 ± 0.534
1.818TyrThr: 1.818 ± 0.44
2.424TyrVal: 2.424 ± 0.453
0.433TyrTrp: 0.433 ± 0.225
1.125TyrTyr: 1.125 ± 0.443
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (11552 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski