Amino acid dipepetide frequency for Lactococcus phage CHPC1183

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.328AlaAla: 0.328 ± 0.182
0.164AlaCys: 0.164 ± 0.169
4.1AlaAsp: 4.1 ± 0.681
5.248AlaGlu: 5.248 ± 1.209
2.46AlaPhe: 2.46 ± 0.534
2.624AlaGly: 2.624 ± 0.851
0.164AlaHis: 0.164 ± 0.159
4.428AlaIle: 4.428 ± 0.884
6.232AlaLys: 6.232 ± 1.113
6.888AlaLeu: 6.888 ± 1.23
1.968AlaMet: 1.968 ± 0.534
5.084AlaAsn: 5.084 ± 0.854
1.64AlaPro: 1.64 ± 0.498
3.116AlaGln: 3.116 ± 0.971
1.476AlaArg: 1.476 ± 0.482
3.772AlaSer: 3.772 ± 0.784
3.608AlaThr: 3.608 ± 0.994
4.1AlaVal: 4.1 ± 1.3
0.82AlaTrp: 0.82 ± 0.357
3.772AlaTyr: 3.772 ± 0.907
0.0AlaXaa: 0.0 ± 0.0
Cys
0.492CysAla: 0.492 ± 0.279
0.328CysCys: 0.328 ± 0.238
0.328CysAsp: 0.328 ± 0.245
0.328CysGlu: 0.328 ± 0.229
0.328CysPhe: 0.328 ± 0.238
0.492CysGly: 0.492 ± 0.269
0.328CysHis: 0.328 ± 0.358
0.164CysIle: 0.164 ± 0.166
1.312CysLys: 1.312 ± 0.481
0.984CysLeu: 0.984 ± 0.539
0.0CysMet: 0.0 ± 0.0
0.492CysAsn: 0.492 ± 0.366
0.328CysPro: 0.328 ± 0.203
0.328CysGln: 0.328 ± 0.253
0.656CysArg: 0.656 ± 0.414
0.656CysSer: 0.656 ± 0.254
0.656CysThr: 0.656 ± 0.334
0.492CysVal: 0.492 ± 0.25
0.328CysTrp: 0.328 ± 0.203
0.82CysTyr: 0.82 ± 0.363
0.0CysXaa: 0.0 ± 0.0
Asp
3.444AspAla: 3.444 ± 0.844
0.328AspCys: 0.328 ± 0.209
4.1AspAsp: 4.1 ± 0.946
5.904AspGlu: 5.904 ± 1.02
4.92AspPhe: 4.92 ± 0.735
4.428AspGly: 4.428 ± 0.884
0.164AspHis: 0.164 ± 0.171
3.608AspIle: 3.608 ± 0.898
5.412AspLys: 5.412 ± 0.959
5.576AspLeu: 5.576 ± 0.841
1.968AspMet: 1.968 ± 0.654
3.608AspAsn: 3.608 ± 0.864
1.312AspPro: 1.312 ± 0.311
0.82AspGln: 0.82 ± 0.46
1.804AspArg: 1.804 ± 0.479
2.624AspSer: 2.624 ± 0.483
3.936AspThr: 3.936 ± 0.619
2.624AspVal: 2.624 ± 0.709
1.148AspTrp: 1.148 ± 0.489
3.936AspTyr: 3.936 ± 0.981
0.0AspXaa: 0.0 ± 0.0
Glu
3.936GluAla: 3.936 ± 0.701
1.312GluCys: 1.312 ± 0.471
3.772GluAsp: 3.772 ± 0.674
6.724GluGlu: 6.724 ± 1.722
4.264GluPhe: 4.264 ± 0.991
3.444GluGly: 3.444 ± 0.672
1.476GluHis: 1.476 ± 0.612
4.428GluIle: 4.428 ± 0.706
4.756GluLys: 4.756 ± 0.91
7.543GluLeu: 7.543 ± 1.485
3.772GluMet: 3.772 ± 0.729
3.116GluAsn: 3.116 ± 0.595
0.984GluPro: 0.984 ± 0.439
3.772GluGln: 3.772 ± 0.806
4.1GluArg: 4.1 ± 1.023
1.804GluSer: 1.804 ± 0.467
5.904GluThr: 5.904 ± 0.726
4.592GluVal: 4.592 ± 0.874
0.984GluTrp: 0.984 ± 0.349
2.952GluTyr: 2.952 ± 1.051
0.0GluXaa: 0.0 ± 0.0
Phe
2.296PheAla: 2.296 ± 0.511
0.82PheCys: 0.82 ± 0.295
3.28PheAsp: 3.28 ± 0.659
2.952PheGlu: 2.952 ± 0.69
1.148PhePhe: 1.148 ± 0.389
3.608PheGly: 3.608 ± 0.526
0.492PheHis: 0.492 ± 0.225
4.1PheIle: 4.1 ± 1.091
5.248PheLys: 5.248 ± 1.274
2.296PheLeu: 2.296 ± 0.483
1.148PheMet: 1.148 ± 0.449
4.1PheAsn: 4.1 ± 0.835
1.804PhePro: 1.804 ± 0.533
0.656PheGln: 0.656 ± 0.251
1.64PheArg: 1.64 ± 0.457
2.46PheSer: 2.46 ± 0.618
2.952PheThr: 2.952 ± 0.657
2.132PheVal: 2.132 ± 0.665
0.984PheTrp: 0.984 ± 0.335
2.132PheTyr: 2.132 ± 0.501
0.0PheXaa: 0.0 ± 0.0
Gly
4.264GlyAla: 4.264 ± 0.91
0.328GlyCys: 0.328 ± 0.216
3.936GlyAsp: 3.936 ± 0.818
2.952GlyGlu: 2.952 ± 0.589
4.1GlyPhe: 4.1 ± 1.235
5.412GlyGly: 5.412 ± 1.054
0.656GlyHis: 0.656 ± 0.261
4.1GlyIle: 4.1 ± 0.794
5.412GlyLys: 5.412 ± 0.94
6.724GlyLeu: 6.724 ± 0.984
2.624GlyMet: 2.624 ± 0.852
2.624GlyAsn: 2.624 ± 0.796
0.656GlyPro: 0.656 ± 0.235
2.624GlyGln: 2.624 ± 0.637
3.444GlyArg: 3.444 ± 0.616
5.248GlySer: 5.248 ± 0.928
5.248GlyThr: 5.248 ± 0.978
4.92GlyVal: 4.92 ± 0.913
1.148GlyTrp: 1.148 ± 0.505
3.608GlyTyr: 3.608 ± 0.714
0.0GlyXaa: 0.0 ± 0.0
His
1.148HisAla: 1.148 ± 0.407
0.164HisCys: 0.164 ± 0.159
0.82HisAsp: 0.82 ± 0.308
0.656HisGlu: 0.656 ± 0.4
0.82HisPhe: 0.82 ± 0.35
0.328HisGly: 0.328 ± 0.221
0.164HisHis: 0.164 ± 0.159
0.984HisIle: 0.984 ± 0.314
0.984HisLys: 0.984 ± 0.548
1.148HisLeu: 1.148 ± 0.454
0.164HisMet: 0.164 ± 0.159
0.656HisAsn: 0.656 ± 0.399
0.0HisPro: 0.0 ± 0.0
0.328HisGln: 0.328 ± 0.238
0.328HisArg: 0.328 ± 0.254
0.984HisSer: 0.984 ± 0.362
0.164HisThr: 0.164 ± 0.16
0.82HisVal: 0.82 ± 0.441
0.492HisTrp: 0.492 ± 0.243
0.984HisTyr: 0.984 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
3.116IleAla: 3.116 ± 0.604
0.164IleCys: 0.164 ± 0.135
6.396IleAsp: 6.396 ± 0.952
5.74IleGlu: 5.74 ± 0.714
1.968IlePhe: 1.968 ± 0.639
3.936IleGly: 3.936 ± 0.937
0.984IleHis: 0.984 ± 0.482
4.428IleIle: 4.428 ± 1.066
5.904IleLys: 5.904 ± 1.112
3.772IleLeu: 3.772 ± 0.879
1.64IleMet: 1.64 ± 0.513
5.576IleAsn: 5.576 ± 0.922
1.64IlePro: 1.64 ± 0.378
1.476IleGln: 1.476 ± 0.43
1.476IleArg: 1.476 ± 0.467
2.46IleSer: 2.46 ± 0.534
4.92IleThr: 4.92 ± 0.908
2.624IleVal: 2.624 ± 0.695
0.82IleTrp: 0.82 ± 0.283
1.968IleTyr: 1.968 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
8.691LysAla: 8.691 ± 0.965
0.984LysCys: 0.984 ± 0.387
5.084LysAsp: 5.084 ± 1.217
8.199LysGlu: 8.199 ± 1.346
3.608LysPhe: 3.608 ± 0.616
7.379LysGly: 7.379 ± 1.074
1.148LysHis: 1.148 ± 0.51
4.1LysIle: 4.1 ± 0.884
8.363LysLys: 8.363 ± 1.588
6.068LysLeu: 6.068 ± 0.955
3.772LysMet: 3.772 ± 0.752
4.756LysAsn: 4.756 ± 0.816
2.788LysPro: 2.788 ± 0.734
4.92LysGln: 4.92 ± 0.8
2.952LysArg: 2.952 ± 0.683
2.952LysSer: 2.952 ± 0.63
4.592LysThr: 4.592 ± 0.919
6.396LysVal: 6.396 ± 0.95
1.312LysTrp: 1.312 ± 0.44
3.444LysTyr: 3.444 ± 0.858
0.0LysXaa: 0.0 ± 0.0
Leu
4.264LeuAla: 4.264 ± 1.102
0.984LeuCys: 0.984 ± 0.413
3.936LeuAsp: 3.936 ± 0.809
6.724LeuGlu: 6.724 ± 1.131
4.1LeuPhe: 4.1 ± 0.649
6.56LeuGly: 6.56 ± 1.381
1.312LeuHis: 1.312 ± 0.45
5.904LeuIle: 5.904 ± 1.157
7.871LeuLys: 7.871 ± 0.971
6.232LeuLeu: 6.232 ± 1.07
2.46LeuMet: 2.46 ± 0.816
5.412LeuAsn: 5.412 ± 0.962
2.624LeuPro: 2.624 ± 0.544
3.444LeuGln: 3.444 ± 0.758
2.46LeuArg: 2.46 ± 0.669
4.428LeuSer: 4.428 ± 0.82
6.068LeuThr: 6.068 ± 0.846
4.756LeuVal: 4.756 ± 0.828
0.656LeuTrp: 0.656 ± 0.356
2.624LeuTyr: 2.624 ± 0.717
0.0LeuXaa: 0.0 ± 0.0
Met
3.608MetAla: 3.608 ± 0.666
0.164MetCys: 0.164 ± 0.165
1.148MetAsp: 1.148 ± 0.37
2.788MetGlu: 2.788 ± 0.741
1.148MetPhe: 1.148 ± 0.44
1.64MetGly: 1.64 ± 0.541
0.0MetHis: 0.0 ± 0.0
2.624MetIle: 2.624 ± 0.693
2.788MetLys: 2.788 ± 0.492
2.788MetLeu: 2.788 ± 0.76
0.164MetMet: 0.164 ± 0.128
1.64MetAsn: 1.64 ± 0.534
0.82MetPro: 0.82 ± 0.286
0.984MetGln: 0.984 ± 0.368
1.312MetArg: 1.312 ± 0.629
1.804MetSer: 1.804 ± 0.534
1.804MetThr: 1.804 ± 0.605
1.476MetVal: 1.476 ± 0.545
0.164MetTrp: 0.164 ± 0.185
1.148MetTyr: 1.148 ± 0.593
0.0MetXaa: 0.0 ± 0.0
Asn
5.248AsnAla: 5.248 ± 1.095
0.984AsnCys: 0.984 ± 0.438
3.608AsnAsp: 3.608 ± 1.037
4.92AsnGlu: 4.92 ± 0.751
2.624AsnPhe: 2.624 ± 0.564
4.592AsnGly: 4.592 ± 0.598
0.328AsnHis: 0.328 ± 0.234
3.772AsnIle: 3.772 ± 0.824
5.74AsnLys: 5.74 ± 0.908
4.592AsnLeu: 4.592 ± 0.848
1.804AsnMet: 1.804 ± 0.482
3.772AsnAsn: 3.772 ± 0.716
2.46AsnPro: 2.46 ± 0.805
2.296AsnGln: 2.296 ± 0.647
1.804AsnArg: 1.804 ± 0.583
2.624AsnSer: 2.624 ± 0.599
4.1AsnThr: 4.1 ± 0.909
3.608AsnVal: 3.608 ± 0.508
0.984AsnTrp: 0.984 ± 0.365
2.624AsnTyr: 2.624 ± 0.549
0.0AsnXaa: 0.0 ± 0.0
Pro
0.984ProAla: 0.984 ± 0.374
0.164ProCys: 0.164 ± 0.164
1.968ProAsp: 1.968 ± 0.479
1.312ProGlu: 1.312 ± 0.35
1.312ProPhe: 1.312 ± 0.48
0.328ProGly: 0.328 ± 0.209
0.164ProHis: 0.164 ± 0.171
1.476ProIle: 1.476 ± 0.536
2.788ProLys: 2.788 ± 0.579
2.624ProLeu: 2.624 ± 0.505
0.328ProMet: 0.328 ± 0.274
2.624ProAsn: 2.624 ± 0.798
1.312ProPro: 1.312 ± 0.421
1.804ProGln: 1.804 ± 0.621
0.984ProArg: 0.984 ± 0.353
1.476ProSer: 1.476 ± 0.481
1.64ProThr: 1.64 ± 0.471
1.312ProVal: 1.312 ± 0.454
0.164ProTrp: 0.164 ± 0.164
1.64ProTyr: 1.64 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
3.772GlnAla: 3.772 ± 0.853
0.656GlnCys: 0.656 ± 0.28
1.804GlnAsp: 1.804 ± 0.584
1.476GlnGlu: 1.476 ± 0.427
1.64GlnPhe: 1.64 ± 0.626
3.444GlnGly: 3.444 ± 0.604
0.328GlnHis: 0.328 ± 0.208
2.952GlnIle: 2.952 ± 0.527
3.772GlnLys: 3.772 ± 0.935
3.116GlnLeu: 3.116 ± 0.667
1.476GlnMet: 1.476 ± 0.553
2.296GlnAsn: 2.296 ± 0.749
0.656GlnPro: 0.656 ± 0.289
1.312GlnGln: 1.312 ± 0.378
1.804GlnArg: 1.804 ± 0.553
1.64GlnSer: 1.64 ± 0.506
2.132GlnThr: 2.132 ± 0.582
1.968GlnVal: 1.968 ± 0.567
0.328GlnTrp: 0.328 ± 0.22
1.476GlnTyr: 1.476 ± 0.512
0.0GlnXaa: 0.0 ± 0.0
Arg
1.476ArgAla: 1.476 ± 0.391
0.492ArgCys: 0.492 ± 0.289
1.64ArgAsp: 1.64 ± 0.445
4.264ArgGlu: 4.264 ± 1.133
1.64ArgPhe: 1.64 ± 0.541
2.788ArgGly: 2.788 ± 0.517
0.164ArgHis: 0.164 ± 0.164
0.82ArgIle: 0.82 ± 0.411
3.936ArgLys: 3.936 ± 0.674
2.624ArgLeu: 2.624 ± 0.471
1.312ArgMet: 1.312 ± 0.37
1.968ArgAsn: 1.968 ± 0.692
1.148ArgPro: 1.148 ± 0.474
0.984ArgGln: 0.984 ± 0.306
1.312ArgArg: 1.312 ± 0.52
2.296ArgSer: 2.296 ± 0.678
1.968ArgThr: 1.968 ± 0.495
2.788ArgVal: 2.788 ± 0.841
0.656ArgTrp: 0.656 ± 0.348
1.312ArgTyr: 1.312 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
3.116SerAla: 3.116 ± 0.869
0.0SerCys: 0.0 ± 0.0
3.116SerAsp: 3.116 ± 0.627
2.788SerGlu: 2.788 ± 0.755
1.64SerPhe: 1.64 ± 0.598
4.756SerGly: 4.756 ± 1.018
0.984SerHis: 0.984 ± 0.327
3.444SerIle: 3.444 ± 0.696
4.1SerLys: 4.1 ± 0.724
4.592SerLeu: 4.592 ± 0.732
1.148SerMet: 1.148 ± 0.355
2.952SerAsn: 2.952 ± 0.643
0.656SerPro: 0.656 ± 0.32
1.968SerGln: 1.968 ± 0.694
0.984SerArg: 0.984 ± 0.355
3.444SerSer: 3.444 ± 0.748
4.264SerThr: 4.264 ± 0.854
3.772SerVal: 3.772 ± 0.528
0.82SerTrp: 0.82 ± 0.386
2.788SerTyr: 2.788 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
4.1ThrAla: 4.1 ± 0.861
0.164ThrCys: 0.164 ± 0.18
5.084ThrAsp: 5.084 ± 1.014
3.608ThrGlu: 3.608 ± 0.858
2.788ThrPhe: 2.788 ± 0.695
6.396ThrGly: 6.396 ± 1.277
1.148ThrHis: 1.148 ± 0.385
4.264ThrIle: 4.264 ± 0.879
6.724ThrLys: 6.724 ± 1.01
7.215ThrLeu: 7.215 ± 1.051
1.148ThrMet: 1.148 ± 0.479
3.116ThrAsn: 3.116 ± 0.53
2.132ThrPro: 2.132 ± 0.482
2.624ThrGln: 2.624 ± 0.63
1.64ThrArg: 1.64 ± 0.432
3.608ThrSer: 3.608 ± 0.902
3.444ThrThr: 3.444 ± 0.596
4.1ThrVal: 4.1 ± 0.768
0.656ThrTrp: 0.656 ± 0.314
2.788ThrTyr: 2.788 ± 0.817
0.0ThrXaa: 0.0 ± 0.0
Val
2.624ValAla: 2.624 ± 0.675
0.328ValCys: 0.328 ± 0.197
5.084ValAsp: 5.084 ± 0.695
3.444ValGlu: 3.444 ± 0.719
2.46ValPhe: 2.46 ± 0.619
2.624ValGly: 2.624 ± 0.872
0.656ValHis: 0.656 ± 0.291
2.624ValIle: 2.624 ± 0.823
5.248ValLys: 5.248 ± 1.058
3.608ValLeu: 3.608 ± 0.757
1.312ValMet: 1.312 ± 0.52
3.936ValAsn: 3.936 ± 0.896
2.296ValPro: 2.296 ± 0.578
2.46ValGln: 2.46 ± 0.758
3.28ValArg: 3.28 ± 0.789
4.428ValSer: 4.428 ± 0.907
4.264ValThr: 4.264 ± 0.799
2.132ValVal: 2.132 ± 0.703
1.312ValTrp: 1.312 ± 0.381
3.444ValTyr: 3.444 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
1.476TrpAla: 1.476 ± 0.381
0.492TrpCys: 0.492 ± 0.271
0.492TrpAsp: 0.492 ± 0.273
0.492TrpGlu: 0.492 ± 0.25
0.984TrpPhe: 0.984 ± 0.344
1.312TrpGly: 1.312 ± 0.392
0.164TrpHis: 0.164 ± 0.164
0.328TrpIle: 0.328 ± 0.214
0.82TrpLys: 0.82 ± 0.323
2.132TrpLeu: 2.132 ± 0.574
0.164TrpMet: 0.164 ± 0.163
1.148TrpAsn: 1.148 ± 0.614
0.0TrpPro: 0.0 ± 0.0
0.82TrpGln: 0.82 ± 0.394
0.492TrpArg: 0.492 ± 0.276
0.164TrpSer: 0.164 ± 0.135
1.804TrpThr: 1.804 ± 0.577
0.82TrpVal: 0.82 ± 0.397
0.328TrpTrp: 0.328 ± 0.231
0.328TrpTyr: 0.328 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.608TyrAla: 3.608 ± 0.781
0.82TyrCys: 0.82 ± 0.287
1.968TyrAsp: 1.968 ± 0.386
3.28TyrGlu: 3.28 ± 0.723
2.296TyrPhe: 2.296 ± 0.681
3.936TyrGly: 3.936 ± 0.818
1.312TyrHis: 1.312 ± 0.483
2.46TyrIle: 2.46 ± 0.666
4.1TyrLys: 4.1 ± 0.711
2.296TyrLeu: 2.296 ± 0.611
1.476TyrMet: 1.476 ± 0.48
3.608TyrAsn: 3.608 ± 0.825
1.148TyrPro: 1.148 ± 0.46
1.312TyrGln: 1.312 ± 0.493
1.64TyrArg: 1.64 ± 0.523
2.46TyrSer: 2.46 ± 0.568
3.28TyrThr: 3.28 ± 0.609
2.132TyrVal: 2.132 ± 0.53
0.656TyrTrp: 0.656 ± 0.331
2.46TyrTyr: 2.46 ± 0.766
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (6099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski