Amino acid dipepetide frequency for Synechococcus phage S-CBP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.979AlaAla: 10.979 ± 1.327
0.318AlaCys: 0.318 ± 0.142
6.444AlaAsp: 6.444 ± 0.822
5.648AlaGlu: 5.648 ± 0.848
4.057AlaPhe: 4.057 ± 0.783
6.444AlaGly: 6.444 ± 0.849
1.193AlaHis: 1.193 ± 0.321
4.375AlaIle: 4.375 ± 0.561
6.126AlaLys: 6.126 ± 0.927
8.035AlaLeu: 8.035 ± 0.912
3.66AlaMet: 3.66 ± 0.681
5.091AlaAsn: 5.091 ± 0.795
2.864AlaPro: 2.864 ± 0.507
4.375AlaGln: 4.375 ± 0.646
4.773AlaArg: 4.773 ± 0.823
6.364AlaSer: 6.364 ± 0.855
4.773AlaThr: 4.773 ± 0.615
4.853AlaVal: 4.853 ± 0.62
1.273AlaTrp: 1.273 ± 0.327
3.58AlaTyr: 3.58 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.222
0.159CysCys: 0.159 ± 0.112
0.398CysAsp: 0.398 ± 0.172
0.398CysGlu: 0.398 ± 0.194
0.318CysPhe: 0.318 ± 0.163
0.477CysGly: 0.477 ± 0.193
0.239CysHis: 0.239 ± 0.12
0.159CysIle: 0.159 ± 0.093
0.318CysLys: 0.318 ± 0.263
0.716CysLeu: 0.716 ± 0.254
0.08CysMet: 0.08 ± 0.074
0.398CysAsn: 0.398 ± 0.174
0.159CysPro: 0.159 ± 0.108
0.398CysGln: 0.398 ± 0.214
0.318CysArg: 0.318 ± 0.205
0.636CysSer: 0.636 ± 0.27
0.477CysThr: 0.477 ± 0.224
0.398CysVal: 0.398 ± 0.183
0.0CysTrp: 0.0 ± 0.0
0.398CysTyr: 0.398 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
6.046AspAla: 6.046 ± 0.707
0.398AspCys: 0.398 ± 0.185
4.057AspAsp: 4.057 ± 0.73
4.535AspGlu: 4.535 ± 0.711
2.466AspPhe: 2.466 ± 0.483
5.171AspGly: 5.171 ± 0.933
1.114AspHis: 1.114 ± 0.319
3.66AspIle: 3.66 ± 0.468
2.546AspLys: 2.546 ± 0.521
5.807AspLeu: 5.807 ± 0.68
1.273AspMet: 1.273 ± 0.235
2.466AspAsn: 2.466 ± 0.43
3.103AspPro: 3.103 ± 0.594
1.909AspGln: 1.909 ± 0.345
2.546AspArg: 2.546 ± 0.44
4.694AspSer: 4.694 ± 0.793
3.262AspThr: 3.262 ± 0.619
3.819AspVal: 3.819 ± 0.505
0.716AspTrp: 0.716 ± 0.233
2.705AspTyr: 2.705 ± 0.552
0.0AspXaa: 0.0 ± 0.0
Glu
6.603GluAla: 6.603 ± 0.749
0.318GluCys: 0.318 ± 0.154
2.944GluAsp: 2.944 ± 0.536
6.762GluGlu: 6.762 ± 1.34
2.546GluPhe: 2.546 ± 0.468
3.66GluGly: 3.66 ± 0.61
1.512GluHis: 1.512 ± 0.366
3.66GluIle: 3.66 ± 0.566
3.182GluLys: 3.182 ± 0.587
6.205GluLeu: 6.205 ± 0.631
1.273GluMet: 1.273 ± 0.382
3.262GluAsn: 3.262 ± 0.524
2.068GluPro: 2.068 ± 0.455
3.341GluGln: 3.341 ± 0.638
3.103GluArg: 3.103 ± 0.64
3.5GluSer: 3.5 ± 0.571
3.58GluThr: 3.58 ± 0.568
3.5GluVal: 3.5 ± 0.503
1.273GluTrp: 1.273 ± 0.378
2.705GluTyr: 2.705 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
2.148PheAla: 2.148 ± 0.524
0.159PheCys: 0.159 ± 0.107
2.784PheAsp: 2.784 ± 0.59
1.671PheGlu: 1.671 ± 0.292
1.591PhePhe: 1.591 ± 0.452
2.068PheGly: 2.068 ± 0.573
0.716PheHis: 0.716 ± 0.231
2.148PheIle: 2.148 ± 0.521
2.228PheLys: 2.228 ± 0.38
2.546PheLeu: 2.546 ± 0.51
1.512PheMet: 1.512 ± 0.406
2.148PheAsn: 2.148 ± 0.53
1.432PhePro: 1.432 ± 0.254
1.591PheGln: 1.591 ± 0.469
1.989PheArg: 1.989 ± 0.344
2.625PheSer: 2.625 ± 0.412
2.387PheThr: 2.387 ± 0.484
1.83PheVal: 1.83 ± 0.395
0.239PheTrp: 0.239 ± 0.156
1.512PheTyr: 1.512 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
5.728GlyAla: 5.728 ± 0.942
0.557GlyCys: 0.557 ± 0.189
4.694GlyAsp: 4.694 ± 0.678
3.819GlyGlu: 3.819 ± 0.635
2.466GlyPhe: 2.466 ± 0.519
3.819GlyGly: 3.819 ± 0.624
1.114GlyHis: 1.114 ± 0.339
4.296GlyIle: 4.296 ± 0.703
4.614GlyLys: 4.614 ± 0.644
6.205GlyLeu: 6.205 ± 0.748
1.909GlyMet: 1.909 ± 0.464
3.341GlyAsn: 3.341 ± 0.645
1.75GlyPro: 1.75 ± 0.368
3.66GlyGln: 3.66 ± 0.683
3.898GlyArg: 3.898 ± 0.558
5.091GlySer: 5.091 ± 0.883
5.489GlyThr: 5.489 ± 1.227
5.489GlyVal: 5.489 ± 0.644
1.114GlyTrp: 1.114 ± 0.309
2.625GlyTyr: 2.625 ± 0.589
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.277
0.159HisCys: 0.159 ± 0.106
0.955HisAsp: 0.955 ± 0.351
1.193HisGlu: 1.193 ± 0.305
0.239HisPhe: 0.239 ± 0.237
1.352HisGly: 1.352 ± 0.325
0.239HisHis: 0.239 ± 0.139
0.796HisIle: 0.796 ± 0.302
0.716HisLys: 0.716 ± 0.269
1.671HisLeu: 1.671 ± 0.524
0.636HisMet: 0.636 ± 0.228
0.716HisAsn: 0.716 ± 0.221
0.875HisPro: 0.875 ± 0.224
0.955HisGln: 0.955 ± 0.297
1.034HisArg: 1.034 ± 0.284
1.193HisSer: 1.193 ± 0.3
0.716HisThr: 0.716 ± 0.236
1.193HisVal: 1.193 ± 0.306
0.398HisTrp: 0.398 ± 0.188
0.875HisTyr: 0.875 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.967IleAla: 5.967 ± 0.772
0.477IleCys: 0.477 ± 0.217
3.421IleAsp: 3.421 ± 0.559
3.023IleGlu: 3.023 ± 0.463
1.432IlePhe: 1.432 ± 0.337
3.978IleGly: 3.978 ± 0.657
0.875IleHis: 0.875 ± 0.336
2.148IleIle: 2.148 ± 0.407
3.182IleLys: 3.182 ± 0.412
3.421IleLeu: 3.421 ± 0.433
0.955IleMet: 0.955 ± 0.317
2.625IleAsn: 2.625 ± 0.456
1.75IlePro: 1.75 ± 0.293
1.989IleGln: 1.989 ± 0.382
3.421IleArg: 3.421 ± 0.497
2.944IleSer: 2.944 ± 0.601
3.023IleThr: 3.023 ± 0.519
3.182IleVal: 3.182 ± 0.449
0.636IleTrp: 0.636 ± 0.22
1.432IleTyr: 1.432 ± 0.337
0.0IleXaa: 0.0 ± 0.0
Lys
6.126LysAla: 6.126 ± 0.973
0.477LysCys: 0.477 ± 0.209
3.898LysAsp: 3.898 ± 0.552
3.66LysGlu: 3.66 ± 0.588
1.83LysPhe: 1.83 ± 0.357
3.819LysGly: 3.819 ± 0.612
0.636LysHis: 0.636 ± 0.22
2.466LysIle: 2.466 ± 0.338
2.864LysLys: 2.864 ± 0.619
5.012LysLeu: 5.012 ± 0.674
1.671LysMet: 1.671 ± 0.371
2.944LysAsn: 2.944 ± 0.559
2.705LysPro: 2.705 ± 0.818
1.909LysGln: 1.909 ± 0.445
3.341LysArg: 3.341 ± 0.511
2.307LysSer: 2.307 ± 0.455
3.023LysThr: 3.023 ± 0.447
4.216LysVal: 4.216 ± 0.672
0.716LysTrp: 0.716 ± 0.228
1.512LysTyr: 1.512 ± 0.45
0.0LysXaa: 0.0 ± 0.0
Leu
8.194LeuAla: 8.194 ± 0.714
0.318LeuCys: 0.318 ± 0.159
5.41LeuAsp: 5.41 ± 0.644
5.41LeuGlu: 5.41 ± 0.688
2.784LeuPhe: 2.784 ± 0.579
5.648LeuGly: 5.648 ± 0.556
1.591LeuHis: 1.591 ± 0.393
3.819LeuIle: 3.819 ± 0.65
5.41LeuLys: 5.41 ± 0.69
6.762LeuLeu: 6.762 ± 0.72
1.671LeuMet: 1.671 ± 0.278
4.375LeuAsn: 4.375 ± 0.749
3.819LeuPro: 3.819 ± 0.591
5.171LeuGln: 5.171 ± 0.753
5.807LeuArg: 5.807 ± 0.598
5.489LeuSer: 5.489 ± 0.732
5.012LeuThr: 5.012 ± 0.806
4.932LeuVal: 4.932 ± 0.756
0.557LeuTrp: 0.557 ± 0.218
2.864LeuTyr: 2.864 ± 0.507
0.0LeuXaa: 0.0 ± 0.0
Met
2.944MetAla: 2.944 ± 0.451
0.398MetCys: 0.398 ± 0.167
1.114MetAsp: 1.114 ± 0.377
1.75MetGlu: 1.75 ± 0.453
0.398MetPhe: 0.398 ± 0.212
1.512MetGly: 1.512 ± 0.378
0.477MetHis: 0.477 ± 0.265
0.796MetIle: 0.796 ± 0.235
1.193MetLys: 1.193 ± 0.319
2.864MetLeu: 2.864 ± 0.706
0.796MetMet: 0.796 ± 0.246
1.273MetAsn: 1.273 ± 0.32
0.875MetPro: 0.875 ± 0.232
1.273MetGln: 1.273 ± 0.404
1.034MetArg: 1.034 ± 0.25
2.546MetSer: 2.546 ± 0.477
1.352MetThr: 1.352 ± 0.31
0.796MetVal: 0.796 ± 0.237
0.398MetTrp: 0.398 ± 0.159
1.114MetTyr: 1.114 ± 0.364
0.0MetXaa: 0.0 ± 0.0
Asn
4.614AsnAla: 4.614 ± 0.697
0.398AsnCys: 0.398 ± 0.151
3.103AsnAsp: 3.103 ± 0.486
2.466AsnGlu: 2.466 ± 0.527
2.148AsnPhe: 2.148 ± 0.3
4.455AsnGly: 4.455 ± 0.586
0.716AsnHis: 0.716 ± 0.229
2.148AsnIle: 2.148 ± 0.424
2.546AsnLys: 2.546 ± 0.53
3.898AsnLeu: 3.898 ± 0.522
1.273AsnMet: 1.273 ± 0.248
3.58AsnAsn: 3.58 ± 0.587
3.739AsnPro: 3.739 ± 0.583
1.75AsnGln: 1.75 ± 0.375
3.66AsnArg: 3.66 ± 0.56
2.705AsnSer: 2.705 ± 0.323
3.341AsnThr: 3.341 ± 0.528
2.705AsnVal: 2.705 ± 0.543
0.875AsnTrp: 0.875 ± 0.274
2.546AsnTyr: 2.546 ± 0.525
0.0AsnXaa: 0.0 ± 0.0
Pro
3.739ProAla: 3.739 ± 0.828
0.318ProCys: 0.318 ± 0.151
2.784ProAsp: 2.784 ± 0.433
2.705ProGlu: 2.705 ± 0.46
1.671ProPhe: 1.671 ± 0.434
3.103ProGly: 3.103 ± 0.663
0.477ProHis: 0.477 ± 0.216
2.148ProIle: 2.148 ± 0.405
2.148ProLys: 2.148 ± 0.402
2.705ProLeu: 2.705 ± 0.391
0.636ProMet: 0.636 ± 0.242
2.148ProAsn: 2.148 ± 0.336
1.591ProPro: 1.591 ± 0.28
1.75ProGln: 1.75 ± 0.418
1.432ProArg: 1.432 ± 0.228
3.262ProSer: 3.262 ± 0.472
2.546ProThr: 2.546 ± 0.585
2.784ProVal: 2.784 ± 0.615
0.955ProTrp: 0.955 ± 0.278
1.273ProTyr: 1.273 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
5.012GlnAla: 5.012 ± 0.897
0.08GlnCys: 0.08 ± 0.076
2.307GlnAsp: 2.307 ± 0.328
3.103GlnGlu: 3.103 ± 0.602
1.193GlnPhe: 1.193 ± 0.414
3.023GlnGly: 3.023 ± 0.443
0.875GlnHis: 0.875 ± 0.261
2.546GlnIle: 2.546 ± 0.466
2.705GlnLys: 2.705 ± 0.605
3.978GlnLeu: 3.978 ± 0.491
1.273GlnMet: 1.273 ± 0.326
1.83GlnAsn: 1.83 ± 0.389
1.114GlnPro: 1.114 ± 0.248
3.341GlnGln: 3.341 ± 0.511
2.068GlnArg: 2.068 ± 0.418
3.103GlnSer: 3.103 ± 0.435
3.023GlnThr: 3.023 ± 0.443
3.341GlnVal: 3.341 ± 0.556
0.318GlnTrp: 0.318 ± 0.135
2.228GlnTyr: 2.228 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
4.853ArgAla: 4.853 ± 0.712
0.318ArgCys: 0.318 ± 0.177
1.909ArgAsp: 1.909 ± 0.448
3.739ArgGlu: 3.739 ± 0.751
1.83ArgPhe: 1.83 ± 0.353
3.341ArgGly: 3.341 ± 0.574
1.034ArgHis: 1.034 ± 0.308
2.784ArgIle: 2.784 ± 0.513
2.705ArgLys: 2.705 ± 0.577
6.205ArgLeu: 6.205 ± 0.962
1.432ArgMet: 1.432 ± 0.337
2.625ArgAsn: 2.625 ± 0.371
2.387ArgPro: 2.387 ± 0.528
3.103ArgGln: 3.103 ± 0.618
2.864ArgArg: 2.864 ± 0.488
3.819ArgSer: 3.819 ± 0.385
2.784ArgThr: 2.784 ± 0.448
2.307ArgVal: 2.307 ± 0.49
0.477ArgTrp: 0.477 ± 0.221
1.989ArgTyr: 1.989 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
5.887SerAla: 5.887 ± 0.661
0.398SerCys: 0.398 ± 0.183
5.091SerAsp: 5.091 ± 0.848
3.66SerGlu: 3.66 ± 0.502
2.228SerPhe: 2.228 ± 0.331
6.285SerGly: 6.285 ± 1.072
0.875SerHis: 0.875 ± 0.221
3.421SerIle: 3.421 ± 0.526
3.5SerLys: 3.5 ± 0.589
5.41SerLeu: 5.41 ± 0.777
1.193SerMet: 1.193 ± 0.348
4.137SerAsn: 4.137 ± 0.674
2.148SerPro: 2.148 ± 0.492
2.387SerGln: 2.387 ± 0.507
3.421SerArg: 3.421 ± 0.459
4.535SerSer: 4.535 ± 0.959
4.694SerThr: 4.694 ± 0.928
3.58SerVal: 3.58 ± 0.537
0.875SerTrp: 0.875 ± 0.265
2.625SerTyr: 2.625 ± 0.56
0.0SerXaa: 0.0 ± 0.0
Thr
5.41ThrAla: 5.41 ± 0.96
0.398ThrCys: 0.398 ± 0.22
3.182ThrAsp: 3.182 ± 0.594
4.296ThrGlu: 4.296 ± 0.712
2.546ThrPhe: 2.546 ± 0.407
4.932ThrGly: 4.932 ± 0.806
0.557ThrHis: 0.557 ± 0.199
3.58ThrIle: 3.58 ± 0.779
2.784ThrLys: 2.784 ± 0.554
4.375ThrLeu: 4.375 ± 0.542
1.034ThrMet: 1.034 ± 0.214
3.182ThrAsn: 3.182 ± 0.575
3.819ThrPro: 3.819 ± 0.492
2.228ThrGln: 2.228 ± 0.362
2.148ThrArg: 2.148 ± 0.334
4.694ThrSer: 4.694 ± 0.872
5.807ThrThr: 5.807 ± 1.197
3.58ThrVal: 3.58 ± 0.781
0.875ThrTrp: 0.875 ± 0.26
2.705ThrTyr: 2.705 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
5.171ValAla: 5.171 ± 0.579
0.875ValCys: 0.875 ± 0.216
4.455ValAsp: 4.455 ± 0.472
3.5ValGlu: 3.5 ± 0.626
1.75ValPhe: 1.75 ± 0.492
5.251ValGly: 5.251 ± 0.647
1.273ValHis: 1.273 ± 0.301
2.705ValIle: 2.705 ± 0.483
3.341ValLys: 3.341 ± 0.42
4.455ValLeu: 4.455 ± 0.702
1.75ValMet: 1.75 ± 0.378
3.5ValAsn: 3.5 ± 0.612
2.466ValPro: 2.466 ± 0.351
2.466ValGln: 2.466 ± 0.627
2.546ValArg: 2.546 ± 0.463
3.739ValSer: 3.739 ± 0.624
3.978ValThr: 3.978 ± 0.917
4.853ValVal: 4.853 ± 0.729
0.796ValTrp: 0.796 ± 0.297
1.671ValTyr: 1.671 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
1.114TrpAla: 1.114 ± 0.337
0.239TrpCys: 0.239 ± 0.122
0.796TrpAsp: 0.796 ± 0.217
1.193TrpGlu: 1.193 ± 0.309
0.875TrpPhe: 0.875 ± 0.258
0.239TrpGly: 0.239 ± 0.125
0.239TrpHis: 0.239 ± 0.122
0.557TrpIle: 0.557 ± 0.196
0.477TrpLys: 0.477 ± 0.162
1.432TrpLeu: 1.432 ± 0.336
0.159TrpMet: 0.159 ± 0.12
0.716TrpAsn: 0.716 ± 0.244
0.318TrpPro: 0.318 ± 0.181
0.716TrpGln: 0.716 ± 0.191
0.796TrpArg: 0.796 ± 0.255
0.875TrpSer: 0.875 ± 0.331
0.398TrpThr: 0.398 ± 0.164
0.875TrpVal: 0.875 ± 0.217
0.239TrpTrp: 0.239 ± 0.129
0.557TrpTyr: 0.557 ± 0.195
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.341TyrAla: 3.341 ± 0.638
0.239TyrCys: 0.239 ± 0.149
2.546TyrAsp: 2.546 ± 0.585
2.466TyrGlu: 2.466 ± 0.498
0.955TyrPhe: 0.955 ± 0.266
3.103TyrGly: 3.103 ± 0.57
0.955TyrHis: 0.955 ± 0.25
1.671TyrIle: 1.671 ± 0.305
2.546TyrLys: 2.546 ± 0.563
3.341TyrLeu: 3.341 ± 0.563
0.636TyrMet: 0.636 ± 0.251
2.307TyrAsn: 2.307 ± 0.369
1.193TyrPro: 1.193 ± 0.376
2.148TyrGln: 2.148 ± 0.388
2.387TyrArg: 2.387 ± 0.571
2.228TyrSer: 2.228 ± 0.442
2.466TyrThr: 2.466 ± 0.503
2.307TyrVal: 2.307 ± 0.39
0.08TyrTrp: 0.08 ± 0.063
1.273TyrTyr: 1.273 ± 0.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski