Amino acid dipepetide frequency for Lactococcus phage 66901

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.597AlaAla: 0.597 ± 0.375
0.358AlaCys: 0.358 ± 0.28
2.746AlaAsp: 2.746 ± 0.585
5.253AlaGlu: 5.253 ± 0.981
4.179AlaPhe: 4.179 ± 1.282
4.537AlaGly: 4.537 ± 1.047
0.597AlaHis: 0.597 ± 0.298
4.298AlaIle: 4.298 ± 0.75
5.731AlaLys: 5.731 ± 0.959
6.208AlaLeu: 6.208 ± 0.875
2.507AlaMet: 2.507 ± 0.698
3.82AlaAsn: 3.82 ± 0.905
0.716AlaPro: 0.716 ± 0.309
2.03AlaGln: 2.03 ± 0.543
1.791AlaArg: 1.791 ± 0.478
3.582AlaSer: 3.582 ± 0.8
3.701AlaThr: 3.701 ± 0.918
4.059AlaVal: 4.059 ± 1.087
1.671AlaTrp: 1.671 ± 0.722
2.149AlaTyr: 2.149 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
0.119CysAla: 0.119 ± 0.125
0.239CysCys: 0.239 ± 0.182
0.358CysAsp: 0.358 ± 0.196
0.239CysGlu: 0.239 ± 0.189
0.239CysPhe: 0.239 ± 0.167
0.597CysGly: 0.597 ± 0.33
0.358CysHis: 0.358 ± 0.244
0.358CysIle: 0.358 ± 0.199
0.836CysLys: 0.836 ± 0.396
0.716CysLeu: 0.716 ± 0.338
0.119CysMet: 0.119 ± 0.126
0.597CysAsn: 0.597 ± 0.286
0.119CysPro: 0.119 ± 0.125
0.239CysGln: 0.239 ± 0.17
0.597CysArg: 0.597 ± 0.268
0.119CysSer: 0.119 ± 0.119
0.358CysThr: 0.358 ± 0.192
0.358CysVal: 0.358 ± 0.23
0.119CysTrp: 0.119 ± 0.125
0.239CysTyr: 0.239 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
1.552AspAla: 1.552 ± 0.52
0.239AspCys: 0.239 ± 0.169
3.223AspAsp: 3.223 ± 0.741
3.701AspGlu: 3.701 ± 0.8
3.94AspPhe: 3.94 ± 0.607
3.701AspGly: 3.701 ± 0.687
0.478AspHis: 0.478 ± 0.266
4.059AspIle: 4.059 ± 0.662
5.492AspLys: 5.492 ± 0.689
5.969AspLeu: 5.969 ± 0.986
0.955AspMet: 0.955 ± 0.323
4.417AspAsn: 4.417 ± 0.833
1.552AspPro: 1.552 ± 0.473
0.597AspGln: 0.597 ± 0.286
1.91AspArg: 1.91 ± 0.626
2.865AspSer: 2.865 ± 0.577
3.582AspThr: 3.582 ± 0.712
3.462AspVal: 3.462 ± 0.686
0.955AspTrp: 0.955 ± 0.305
2.865AspTyr: 2.865 ± 0.635
0.0AspXaa: 0.0 ± 0.0
Glu
3.582GluAla: 3.582 ± 0.814
0.597GluCys: 0.597 ± 0.33
2.985GluAsp: 2.985 ± 0.483
4.656GluGlu: 4.656 ± 0.806
3.582GluPhe: 3.582 ± 0.53
2.865GluGly: 2.865 ± 0.548
1.313GluHis: 1.313 ± 0.405
6.328GluIle: 6.328 ± 0.851
5.85GluLys: 5.85 ± 1.28
9.432GluLeu: 9.432 ± 1.703
2.985GluMet: 2.985 ± 0.625
4.776GluAsn: 4.776 ± 0.832
1.074GluPro: 1.074 ± 0.357
3.462GluGln: 3.462 ± 0.739
2.507GluArg: 2.507 ± 0.553
3.462GluSer: 3.462 ± 0.729
4.895GluThr: 4.895 ± 0.819
3.82GluVal: 3.82 ± 0.634
1.433GluTrp: 1.433 ± 0.414
2.865GluTyr: 2.865 ± 0.753
0.0GluXaa: 0.0 ± 0.0
Phe
3.223PheAla: 3.223 ± 0.899
0.239PheCys: 0.239 ± 0.159
3.462PheAsp: 3.462 ± 0.69
2.627PheGlu: 2.627 ± 0.558
1.91PhePhe: 1.91 ± 0.768
1.91PheGly: 1.91 ± 0.521
0.358PheHis: 0.358 ± 0.2
3.582PheIle: 3.582 ± 0.722
4.059PheLys: 4.059 ± 0.678
2.985PheLeu: 2.985 ± 0.535
1.194PheMet: 1.194 ± 0.32
2.985PheAsn: 2.985 ± 0.727
0.716PhePro: 0.716 ± 0.311
1.194PheGln: 1.194 ± 0.443
1.194PheArg: 1.194 ± 0.286
4.059PheSer: 4.059 ± 0.984
3.223PheThr: 3.223 ± 0.518
2.388PheVal: 2.388 ± 0.467
0.358PheTrp: 0.358 ± 0.196
1.91PheTyr: 1.91 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
4.298GlyAla: 4.298 ± 1.224
0.239GlyCys: 0.239 ± 0.148
2.865GlyAsp: 2.865 ± 0.727
4.298GlyGlu: 4.298 ± 0.813
2.268GlyPhe: 2.268 ± 0.633
4.656GlyGly: 4.656 ± 1.026
0.836GlyHis: 0.836 ± 0.326
3.94GlyIle: 3.94 ± 1.606
6.566GlyLys: 6.566 ± 0.742
6.208GlyLeu: 6.208 ± 1.287
1.074GlyMet: 1.074 ± 0.367
2.985GlyAsn: 2.985 ± 0.658
0.239GlyPro: 0.239 ± 0.177
2.03GlyGln: 2.03 ± 0.454
2.03GlyArg: 2.03 ± 0.497
4.895GlySer: 4.895 ± 1.062
3.82GlyThr: 3.82 ± 0.637
5.611GlyVal: 5.611 ± 1.203
1.313GlyTrp: 1.313 ± 0.384
3.223GlyTyr: 3.223 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
0.836HisAla: 0.836 ± 0.324
0.478HisCys: 0.478 ± 0.313
0.836HisAsp: 0.836 ± 0.365
0.358HisGlu: 0.358 ± 0.24
0.358HisPhe: 0.358 ± 0.213
1.313HisGly: 1.313 ± 0.426
0.0HisHis: 0.0 ± 0.0
0.955HisIle: 0.955 ± 0.299
0.597HisLys: 0.597 ± 0.264
1.313HisLeu: 1.313 ± 0.426
0.0HisMet: 0.0 ± 0.0
1.671HisAsn: 1.671 ± 0.491
0.119HisPro: 0.119 ± 0.126
0.358HisGln: 0.358 ± 0.202
0.119HisArg: 0.119 ± 0.131
0.239HisSer: 0.239 ± 0.168
0.955HisThr: 0.955 ± 0.321
1.194HisVal: 1.194 ± 0.559
0.239HisTrp: 0.239 ± 0.171
0.597HisTyr: 0.597 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
4.417IleAla: 4.417 ± 0.689
0.239IleCys: 0.239 ± 0.168
4.059IleAsp: 4.059 ± 0.638
6.089IleGlu: 6.089 ± 1.067
3.104IlePhe: 3.104 ± 0.619
3.462IleGly: 3.462 ± 0.947
0.836IleHis: 0.836 ± 0.331
4.417IleIle: 4.417 ± 0.548
6.805IleLys: 6.805 ± 0.908
5.85IleLeu: 5.85 ± 1.102
0.955IleMet: 0.955 ± 0.346
5.372IleAsn: 5.372 ± 0.565
2.03IlePro: 2.03 ± 0.596
2.746IleGln: 2.746 ± 0.557
2.268IleArg: 2.268 ± 0.457
4.417IleSer: 4.417 ± 0.964
4.895IleThr: 4.895 ± 0.595
4.656IleVal: 4.656 ± 0.742
1.194IleTrp: 1.194 ± 0.478
2.388IleTyr: 2.388 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
6.208LysAla: 6.208 ± 1.114
0.239LysCys: 0.239 ± 0.179
4.776LysAsp: 4.776 ± 0.781
7.76LysGlu: 7.76 ± 1.443
1.791LysPhe: 1.791 ± 0.416
5.731LysGly: 5.731 ± 1.126
1.194LysHis: 1.194 ± 0.465
5.969LysIle: 5.969 ± 0.87
9.312LysLys: 9.312 ± 1.415
7.163LysLeu: 7.163 ± 0.776
2.627LysMet: 2.627 ± 0.51
5.372LysAsn: 5.372 ± 0.702
2.746LysPro: 2.746 ± 0.84
3.701LysGln: 3.701 ± 0.849
3.582LysArg: 3.582 ± 0.72
4.656LysSer: 4.656 ± 1.133
5.731LysThr: 5.731 ± 0.871
6.686LysVal: 6.686 ± 0.924
1.194LysTrp: 1.194 ± 0.319
3.462LysTyr: 3.462 ± 0.781
0.0LysXaa: 0.0 ± 0.0
Leu
6.089LeuAla: 6.089 ± 0.897
0.478LeuCys: 0.478 ± 0.246
4.895LeuAsp: 4.895 ± 0.734
5.85LeuGlu: 5.85 ± 0.923
3.94LeuPhe: 3.94 ± 0.703
4.656LeuGly: 4.656 ± 1.021
1.313LeuHis: 1.313 ± 0.537
7.521LeuIle: 7.521 ± 1.019
8.118LeuLys: 8.118 ± 1.193
6.208LeuLeu: 6.208 ± 1.303
1.433LeuMet: 1.433 ± 0.473
5.492LeuAsn: 5.492 ± 0.987
3.343LeuPro: 3.343 ± 0.683
3.343LeuGln: 3.343 ± 0.561
2.388LeuArg: 2.388 ± 0.564
5.014LeuSer: 5.014 ± 0.738
6.208LeuThr: 6.208 ± 0.748
5.492LeuVal: 5.492 ± 0.698
1.433LeuTrp: 1.433 ± 0.444
5.014LeuTyr: 5.014 ± 0.977
0.0LeuXaa: 0.0 ± 0.0
Met
1.671MetAla: 1.671 ± 0.649
0.119MetCys: 0.119 ± 0.13
1.313MetAsp: 1.313 ± 0.399
2.149MetGlu: 2.149 ± 0.586
0.478MetPhe: 0.478 ± 0.289
0.955MetGly: 0.955 ± 0.301
0.358MetHis: 0.358 ± 0.215
2.149MetIle: 2.149 ± 0.587
2.388MetLys: 2.388 ± 0.537
1.91MetLeu: 1.91 ± 0.455
0.358MetMet: 0.358 ± 0.179
1.791MetAsn: 1.791 ± 0.641
0.358MetPro: 0.358 ± 0.211
1.433MetGln: 1.433 ± 0.39
0.239MetArg: 0.239 ± 0.191
1.791MetSer: 1.791 ± 0.419
1.313MetThr: 1.313 ± 0.483
1.671MetVal: 1.671 ± 0.366
0.119MetTrp: 0.119 ± 0.126
1.194MetTyr: 1.194 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
5.372AsnAla: 5.372 ± 1.118
0.358AsnCys: 0.358 ± 0.214
3.94AsnAsp: 3.94 ± 0.851
5.253AsnGlu: 5.253 ± 0.783
1.91AsnPhe: 1.91 ± 0.57
6.328AsnGly: 6.328 ± 0.834
0.836AsnHis: 0.836 ± 0.262
4.776AsnIle: 4.776 ± 0.767
5.85AsnLys: 5.85 ± 1.231
5.85AsnLeu: 5.85 ± 0.898
1.552AsnMet: 1.552 ± 0.453
3.701AsnAsn: 3.701 ± 0.678
2.268AsnPro: 2.268 ± 0.607
2.03AsnGln: 2.03 ± 0.555
1.91AsnArg: 1.91 ± 0.425
4.298AsnSer: 4.298 ± 0.585
4.417AsnThr: 4.417 ± 0.77
3.582AsnVal: 3.582 ± 0.716
1.433AsnTrp: 1.433 ± 0.39
2.507AsnTyr: 2.507 ± 0.642
0.0AsnXaa: 0.0 ± 0.0
Pro
1.194ProAla: 1.194 ± 0.435
0.119ProCys: 0.119 ± 0.125
1.91ProAsp: 1.91 ± 0.464
1.433ProGlu: 1.433 ± 0.474
1.433ProPhe: 1.433 ± 0.409
0.239ProGly: 0.239 ± 0.164
0.119ProHis: 0.119 ± 0.113
1.91ProIle: 1.91 ± 0.632
2.149ProLys: 2.149 ± 0.514
2.268ProLeu: 2.268 ± 0.574
0.597ProMet: 0.597 ± 0.25
1.91ProAsn: 1.91 ± 0.688
0.597ProPro: 0.597 ± 0.278
0.478ProGln: 0.478 ± 0.258
0.358ProArg: 0.358 ± 0.202
1.194ProSer: 1.194 ± 0.469
2.507ProThr: 2.507 ± 0.656
1.791ProVal: 1.791 ± 0.414
0.119ProTrp: 0.119 ± 0.111
0.955ProTyr: 0.955 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
3.104GlnAla: 3.104 ± 0.626
0.0GlnCys: 0.0 ± 0.0
2.268GlnAsp: 2.268 ± 0.676
2.149GlnGlu: 2.149 ± 0.528
1.194GlnPhe: 1.194 ± 0.402
2.746GlnGly: 2.746 ± 0.625
0.478GlnHis: 0.478 ± 0.224
1.313GlnIle: 1.313 ± 0.375
2.746GlnLys: 2.746 ± 0.601
3.343GlnLeu: 3.343 ± 0.727
1.194GlnMet: 1.194 ± 0.299
2.388GlnAsn: 2.388 ± 0.432
1.313GlnPro: 1.313 ± 0.414
1.313GlnGln: 1.313 ± 0.469
1.433GlnArg: 1.433 ± 0.471
2.149GlnSer: 2.149 ± 0.373
2.746GlnThr: 2.746 ± 0.483
2.388GlnVal: 2.388 ± 0.571
0.597GlnTrp: 0.597 ± 0.24
1.074GlnTyr: 1.074 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
1.791ArgAla: 1.791 ± 0.749
0.358ArgCys: 0.358 ± 0.198
1.194ArgAsp: 1.194 ± 0.41
2.149ArgGlu: 2.149 ± 0.461
0.955ArgPhe: 0.955 ± 0.304
2.149ArgGly: 2.149 ± 0.48
0.955ArgHis: 0.955 ± 0.336
1.91ArgIle: 1.91 ± 0.405
4.179ArgLys: 4.179 ± 0.869
3.701ArgLeu: 3.701 ± 0.668
0.597ArgMet: 0.597 ± 0.382
2.746ArgAsn: 2.746 ± 0.624
0.597ArgPro: 0.597 ± 0.262
1.433ArgGln: 1.433 ± 0.498
1.433ArgArg: 1.433 ± 0.379
2.149ArgSer: 2.149 ± 0.474
2.03ArgThr: 2.03 ± 0.457
1.791ArgVal: 1.791 ± 0.537
0.239ArgTrp: 0.239 ± 0.163
1.91ArgTyr: 1.91 ± 0.587
0.0ArgXaa: 0.0 ± 0.0
Ser
5.014SerAla: 5.014 ± 1.504
0.955SerCys: 0.955 ± 0.421
3.223SerAsp: 3.223 ± 0.563
3.701SerGlu: 3.701 ± 0.553
2.985SerPhe: 2.985 ± 0.635
5.731SerGly: 5.731 ± 1.373
0.597SerHis: 0.597 ± 0.271
4.298SerIle: 4.298 ± 0.818
4.417SerLys: 4.417 ± 0.821
5.492SerLeu: 5.492 ± 0.889
1.671SerMet: 1.671 ± 0.401
4.059SerAsn: 4.059 ± 0.704
0.836SerPro: 0.836 ± 0.305
2.268SerGln: 2.268 ± 0.492
2.865SerArg: 2.865 ± 0.475
4.656SerSer: 4.656 ± 0.759
3.223SerThr: 3.223 ± 0.707
3.343SerVal: 3.343 ± 0.699
0.358SerTrp: 0.358 ± 0.194
2.627SerTyr: 2.627 ± 0.663
0.0SerXaa: 0.0 ± 0.0
Thr
5.134ThrAla: 5.134 ± 0.725
0.119ThrCys: 0.119 ± 0.136
3.94ThrAsp: 3.94 ± 0.87
5.85ThrGlu: 5.85 ± 0.656
2.865ThrPhe: 2.865 ± 0.586
4.417ThrGly: 4.417 ± 0.635
0.358ThrHis: 0.358 ± 0.211
3.701ThrIle: 3.701 ± 0.69
4.895ThrLys: 4.895 ± 0.793
5.492ThrLeu: 5.492 ± 0.767
1.313ThrMet: 1.313 ± 0.345
4.895ThrAsn: 4.895 ± 0.755
1.552ThrPro: 1.552 ± 0.351
2.627ThrGln: 2.627 ± 0.536
2.268ThrArg: 2.268 ± 0.415
5.134ThrSer: 5.134 ± 0.635
4.656ThrThr: 4.656 ± 0.996
4.537ThrVal: 4.537 ± 1.037
1.194ThrTrp: 1.194 ± 0.362
2.149ThrTyr: 2.149 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
4.179ValAla: 4.179 ± 0.709
0.716ValCys: 0.716 ± 0.338
4.179ValAsp: 4.179 ± 0.85
4.298ValGlu: 4.298 ± 0.696
3.223ValPhe: 3.223 ± 0.568
3.462ValGly: 3.462 ± 0.509
0.478ValHis: 0.478 ± 0.201
4.298ValIle: 4.298 ± 0.513
6.208ValLys: 6.208 ± 0.925
3.94ValLeu: 3.94 ± 0.656
1.791ValMet: 1.791 ± 0.461
3.462ValAsn: 3.462 ± 0.73
1.671ValPro: 1.671 ± 0.5
2.268ValGln: 2.268 ± 0.523
3.223ValArg: 3.223 ± 0.739
4.298ValSer: 4.298 ± 1.085
5.014ValThr: 5.014 ± 0.831
3.582ValVal: 3.582 ± 1.007
0.478ValTrp: 0.478 ± 0.227
2.865ValTyr: 2.865 ± 0.685
0.0ValXaa: 0.0 ± 0.0
Trp
0.836TrpAla: 0.836 ± 0.275
0.358TrpCys: 0.358 ± 0.216
1.194TrpAsp: 1.194 ± 0.423
0.955TrpGlu: 0.955 ± 0.356
0.836TrpPhe: 0.836 ± 0.352
0.836TrpGly: 0.836 ± 0.296
0.119TrpHis: 0.119 ± 0.155
0.716TrpIle: 0.716 ± 0.307
0.836TrpLys: 0.836 ± 0.307
0.836TrpLeu: 0.836 ± 0.393
0.239TrpMet: 0.239 ± 0.159
1.552TrpAsn: 1.552 ± 0.431
0.0TrpPro: 0.0 ± 0.0
0.955TrpGln: 0.955 ± 0.344
0.597TrpArg: 0.597 ± 0.319
1.074TrpSer: 1.074 ± 0.34
0.836TrpThr: 0.836 ± 0.267
0.716TrpVal: 0.716 ± 0.337
0.239TrpTrp: 0.239 ± 0.139
1.194TrpTyr: 1.194 ± 0.361
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.313TyrAla: 1.313 ± 0.479
0.478TyrCys: 0.478 ± 0.316
2.149TyrAsp: 2.149 ± 0.668
3.701TyrGlu: 3.701 ± 0.785
2.388TyrPhe: 2.388 ± 0.45
3.104TyrGly: 3.104 ± 0.583
0.955TyrHis: 0.955 ± 0.316
3.701TyrIle: 3.701 ± 0.722
2.865TyrLys: 2.865 ± 0.716
3.343TyrLeu: 3.343 ± 0.814
0.478TyrMet: 0.478 ± 0.26
4.298TyrAsn: 4.298 ± 0.598
1.433TyrPro: 1.433 ± 0.443
1.433TyrGln: 1.433 ± 0.437
1.552TyrArg: 1.552 ± 0.427
2.268TyrSer: 2.268 ± 0.591
2.985TyrThr: 2.985 ± 0.78
2.507TyrVal: 2.507 ± 0.551
0.239TyrTrp: 0.239 ± 0.202
2.149TyrTyr: 2.149 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (8377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski