Amino acid dipepetide frequency for Streptococcus phage IPP28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.165AlaAla: 4.165 ± 1.239
0.484AlaCys: 0.484 ± 0.236
4.65AlaAsp: 4.65 ± 0.693
6.297AlaGlu: 6.297 ± 0.734
3.003AlaPhe: 3.003 ± 0.44
3.681AlaGly: 3.681 ± 0.636
0.291AlaHis: 0.291 ± 0.15
5.328AlaIle: 5.328 ± 1.146
4.65AlaLys: 4.65 ± 0.619
7.943AlaLeu: 7.943 ± 1.027
1.744AlaMet: 1.744 ± 0.351
4.165AlaAsn: 4.165 ± 0.942
1.55AlaPro: 1.55 ± 0.334
3.003AlaGln: 3.003 ± 0.901
2.906AlaArg: 2.906 ± 0.393
3.487AlaSer: 3.487 ± 0.636
4.069AlaThr: 4.069 ± 0.521
3.972AlaVal: 3.972 ± 0.64
0.969AlaTrp: 0.969 ± 0.251
1.647AlaTyr: 1.647 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.097CysAsp: 0.097 ± 0.093
0.678CysGlu: 0.678 ± 0.361
0.291CysPhe: 0.291 ± 0.161
0.194CysGly: 0.194 ± 0.107
0.097CysHis: 0.097 ± 0.092
0.678CysIle: 0.678 ± 0.379
0.387CysLys: 0.387 ± 0.165
0.581CysLeu: 0.581 ± 0.247
0.097CysMet: 0.097 ± 0.105
0.097CysAsn: 0.097 ± 0.122
0.097CysPro: 0.097 ± 0.083
0.291CysGln: 0.291 ± 0.171
0.484CysArg: 0.484 ± 0.249
0.291CysSer: 0.291 ± 0.216
0.291CysThr: 0.291 ± 0.198
0.194CysVal: 0.194 ± 0.133
0.0CysTrp: 0.0 ± 0.0
0.581CysTyr: 0.581 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
3.487AspAla: 3.487 ± 0.509
0.194AspCys: 0.194 ± 0.132
3.778AspAsp: 3.778 ± 0.73
3.972AspGlu: 3.972 ± 0.914
2.422AspPhe: 2.422 ± 0.586
5.134AspGly: 5.134 ± 0.889
0.678AspHis: 0.678 ± 0.218
5.619AspIle: 5.619 ± 0.561
4.65AspLys: 4.65 ± 0.791
5.619AspLeu: 5.619 ± 1.037
1.259AspMet: 1.259 ± 0.329
3.39AspAsn: 3.39 ± 0.671
1.55AspPro: 1.55 ± 0.376
1.55AspGln: 1.55 ± 0.334
1.937AspArg: 1.937 ± 0.436
3.681AspSer: 3.681 ± 0.56
2.325AspThr: 2.325 ± 0.441
3.487AspVal: 3.487 ± 0.668
0.775AspTrp: 0.775 ± 0.201
3.1AspTyr: 3.1 ± 0.62
0.0AspXaa: 0.0 ± 0.0
Glu
6.49GluAla: 6.49 ± 1.292
0.484GluCys: 0.484 ± 0.202
3.487GluAsp: 3.487 ± 0.689
6.975GluGlu: 6.975 ± 1.118
3.778GluPhe: 3.778 ± 0.731
3.39GluGly: 3.39 ± 0.509
0.484GluHis: 0.484 ± 0.294
7.362GluIle: 7.362 ± 0.986
8.137GluLys: 8.137 ± 1.134
9.203GluLeu: 9.203 ± 1.0
3.1GluMet: 3.1 ± 0.802
4.456GluAsn: 4.456 ± 0.616
1.356GluPro: 1.356 ± 0.548
4.069GluGln: 4.069 ± 0.811
3.778GluArg: 3.778 ± 0.81
4.069GluSer: 4.069 ± 0.483
4.165GluThr: 4.165 ± 0.637
4.262GluVal: 4.262 ± 0.612
1.259GluTrp: 1.259 ± 0.323
2.325GluTyr: 2.325 ± 0.494
0.0GluXaa: 0.0 ± 0.0
Phe
2.519PheAla: 2.519 ± 0.53
0.097PheCys: 0.097 ± 0.118
3.875PheAsp: 3.875 ± 0.673
2.809PheGlu: 2.809 ± 0.572
1.937PhePhe: 1.937 ± 0.614
3.003PheGly: 3.003 ± 0.506
0.581PheHis: 0.581 ± 0.266
3.003PheIle: 3.003 ± 0.66
4.359PheLys: 4.359 ± 0.483
3.1PheLeu: 3.1 ± 0.699
1.356PheMet: 1.356 ± 0.527
2.809PheAsn: 2.809 ± 0.467
0.872PhePro: 0.872 ± 0.328
1.55PheGln: 1.55 ± 0.434
1.744PheArg: 1.744 ± 0.388
2.809PheSer: 2.809 ± 0.562
2.422PheThr: 2.422 ± 0.418
2.422PheVal: 2.422 ± 0.505
0.581PheTrp: 0.581 ± 0.233
2.034PheTyr: 2.034 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
3.875GlyAla: 3.875 ± 0.565
0.097GlyCys: 0.097 ± 0.12
3.875GlyAsp: 3.875 ± 0.737
3.39GlyGlu: 3.39 ± 0.498
2.325GlyPhe: 2.325 ± 0.478
3.681GlyGly: 3.681 ± 0.608
0.581GlyHis: 0.581 ± 0.213
4.456GlyIle: 4.456 ± 0.606
4.456GlyLys: 4.456 ± 0.63
5.231GlyLeu: 5.231 ± 0.609
1.55GlyMet: 1.55 ± 0.438
3.39GlyAsn: 3.39 ± 0.62
0.969GlyPro: 0.969 ± 0.265
2.422GlyGln: 2.422 ± 0.727
3.584GlyArg: 3.584 ± 0.58
3.39GlySer: 3.39 ± 0.824
3.003GlyThr: 3.003 ± 0.662
4.069GlyVal: 4.069 ± 0.507
1.453GlyTrp: 1.453 ± 0.552
2.422GlyTyr: 2.422 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
0.484HisAla: 0.484 ± 0.2
0.291HisCys: 0.291 ± 0.182
0.872HisAsp: 0.872 ± 0.312
1.259HisGlu: 1.259 ± 0.333
1.356HisPhe: 1.356 ± 0.36
0.581HisGly: 0.581 ± 0.231
0.484HisHis: 0.484 ± 0.219
1.744HisIle: 1.744 ± 0.371
0.484HisLys: 0.484 ± 0.207
1.162HisLeu: 1.162 ± 0.276
0.484HisMet: 0.484 ± 0.178
0.678HisAsn: 0.678 ± 0.247
0.291HisPro: 0.291 ± 0.154
0.581HisGln: 0.581 ± 0.274
0.194HisArg: 0.194 ± 0.131
1.356HisSer: 1.356 ± 0.489
0.775HisThr: 0.775 ± 0.359
1.162HisVal: 1.162 ± 0.362
0.097HisTrp: 0.097 ± 0.111
0.387HisTyr: 0.387 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
6.103IleAla: 6.103 ± 0.554
0.484IleCys: 0.484 ± 0.285
3.39IleAsp: 3.39 ± 0.489
5.812IleGlu: 5.812 ± 0.853
3.487IlePhe: 3.487 ± 0.835
3.39IleGly: 3.39 ± 0.608
0.872IleHis: 0.872 ± 0.253
4.553IleIle: 4.553 ± 0.924
6.393IleLys: 6.393 ± 0.671
5.328IleLeu: 5.328 ± 0.967
1.259IleMet: 1.259 ± 0.297
3.584IleAsn: 3.584 ± 0.524
2.131IlePro: 2.131 ± 0.43
2.712IleGln: 2.712 ± 0.304
3.972IleArg: 3.972 ± 0.643
5.522IleSer: 5.522 ± 0.99
4.262IleThr: 4.262 ± 0.572
4.359IleVal: 4.359 ± 0.894
0.775IleTrp: 0.775 ± 0.22
1.841IleTyr: 1.841 ± 0.498
0.0IleXaa: 0.0 ± 0.0
Lys
4.747LysAla: 4.747 ± 0.854
0.291LysCys: 0.291 ± 0.18
5.812LysAsp: 5.812 ± 0.931
7.847LysGlu: 7.847 ± 1.093
2.906LysPhe: 2.906 ± 0.588
4.456LysGly: 4.456 ± 0.695
2.131LysHis: 2.131 ± 0.511
4.747LysIle: 4.747 ± 0.808
5.619LysLys: 5.619 ± 1.09
5.619LysLeu: 5.619 ± 0.655
2.131LysMet: 2.131 ± 0.479
5.425LysAsn: 5.425 ± 0.597
3.197LysPro: 3.197 ± 0.659
3.875LysGln: 3.875 ± 0.591
4.94LysArg: 4.94 ± 0.927
4.165LysSer: 4.165 ± 0.571
5.715LysThr: 5.715 ± 0.664
4.844LysVal: 4.844 ± 0.878
1.259LysTrp: 1.259 ± 0.364
3.584LysTyr: 3.584 ± 0.823
0.0LysXaa: 0.0 ± 0.0
Leu
6.2LeuAla: 6.2 ± 0.832
0.291LeuCys: 0.291 ± 0.168
6.393LeuAsp: 6.393 ± 0.567
7.556LeuGlu: 7.556 ± 0.802
3.778LeuPhe: 3.778 ± 0.504
6.2LeuGly: 6.2 ± 0.623
1.259LeuHis: 1.259 ± 0.355
4.069LeuIle: 4.069 ± 0.72
8.04LeuLys: 8.04 ± 0.786
6.393LeuLeu: 6.393 ± 1.183
3.003LeuMet: 3.003 ± 0.774
4.359LeuAsn: 4.359 ± 0.617
2.712LeuPro: 2.712 ± 0.702
2.809LeuGln: 2.809 ± 0.864
4.65LeuArg: 4.65 ± 0.678
6.006LeuSer: 6.006 ± 0.811
5.715LeuThr: 5.715 ± 0.611
4.359LeuVal: 4.359 ± 0.631
0.581LeuTrp: 0.581 ± 0.204
2.906LeuTyr: 2.906 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.034MetAla: 2.034 ± 0.432
0.097MetCys: 0.097 ± 0.118
1.066MetAsp: 1.066 ± 0.327
2.228MetGlu: 2.228 ± 0.527
1.453MetPhe: 1.453 ± 0.56
1.162MetGly: 1.162 ± 0.276
0.0MetHis: 0.0 ± 0.0
1.453MetIle: 1.453 ± 0.424
1.937MetLys: 1.937 ± 0.568
2.034MetLeu: 2.034 ± 0.476
0.678MetMet: 0.678 ± 0.266
2.228MetAsn: 2.228 ± 0.539
0.291MetPro: 0.291 ± 0.136
0.969MetGln: 0.969 ± 0.322
1.55MetArg: 1.55 ± 0.471
1.744MetSer: 1.744 ± 0.482
2.034MetThr: 2.034 ± 0.418
1.744MetVal: 1.744 ± 0.389
0.097MetTrp: 0.097 ± 0.092
0.484MetTyr: 0.484 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.262AsnAla: 4.262 ± 0.566
0.194AsnCys: 0.194 ± 0.11
2.712AsnAsp: 2.712 ± 0.444
4.94AsnGlu: 4.94 ± 0.534
2.616AsnPhe: 2.616 ± 0.531
4.262AsnGly: 4.262 ± 0.715
1.259AsnHis: 1.259 ± 0.356
3.681AsnIle: 3.681 ± 0.539
5.134AsnLys: 5.134 ± 0.631
4.65AsnLeu: 4.65 ± 0.724
0.775AsnMet: 0.775 ± 0.398
1.55AsnAsn: 1.55 ± 0.353
2.325AsnPro: 2.325 ± 0.435
3.681AsnGln: 3.681 ± 0.672
2.325AsnArg: 2.325 ± 0.615
2.809AsnSer: 2.809 ± 0.681
2.616AsnThr: 2.616 ± 0.412
4.262AsnVal: 4.262 ± 0.713
0.969AsnTrp: 0.969 ± 0.307
0.969AsnTyr: 0.969 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 0.432
0.291ProCys: 0.291 ± 0.189
2.131ProAsp: 2.131 ± 0.479
2.034ProGlu: 2.034 ± 0.448
1.066ProPhe: 1.066 ± 0.431
0.678ProGly: 0.678 ± 0.269
0.581ProHis: 0.581 ± 0.197
2.034ProIle: 2.034 ± 0.559
2.519ProLys: 2.519 ± 0.553
2.712ProLeu: 2.712 ± 0.558
0.291ProMet: 0.291 ± 0.16
0.775ProAsn: 0.775 ± 0.326
0.291ProPro: 0.291 ± 0.214
1.453ProGln: 1.453 ± 0.511
0.872ProArg: 0.872 ± 0.285
1.453ProSer: 1.453 ± 0.356
1.841ProThr: 1.841 ± 0.505
2.325ProVal: 2.325 ± 0.33
0.291ProTrp: 0.291 ± 0.161
0.775ProTyr: 0.775 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
4.165GlnAla: 4.165 ± 0.907
0.194GlnCys: 0.194 ± 0.15
1.647GlnAsp: 1.647 ± 0.468
4.262GlnGlu: 4.262 ± 0.752
1.744GlnPhe: 1.744 ± 0.387
1.744GlnGly: 1.744 ± 0.38
0.581GlnHis: 0.581 ± 0.333
2.422GlnIle: 2.422 ± 0.48
2.906GlnLys: 2.906 ± 0.492
2.906GlnLeu: 2.906 ± 0.536
1.356GlnMet: 1.356 ± 0.426
2.519GlnAsn: 2.519 ± 0.423
1.259GlnPro: 1.259 ± 0.422
1.453GlnGln: 1.453 ± 0.434
2.422GlnArg: 2.422 ± 0.644
3.487GlnSer: 3.487 ± 0.522
2.906GlnThr: 2.906 ± 0.575
2.422GlnVal: 2.422 ± 0.584
0.387GlnTrp: 0.387 ± 0.219
0.969GlnTyr: 0.969 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
3.1ArgAla: 3.1 ± 0.757
0.291ArgCys: 0.291 ± 0.147
1.841ArgAsp: 1.841 ± 0.416
4.069ArgGlu: 4.069 ± 0.695
1.841ArgPhe: 1.841 ± 0.42
1.937ArgGly: 1.937 ± 0.447
0.872ArgHis: 0.872 ± 0.331
3.681ArgIle: 3.681 ± 0.793
4.359ArgLys: 4.359 ± 0.841
3.972ArgLeu: 3.972 ± 0.746
1.453ArgMet: 1.453 ± 0.278
3.778ArgAsn: 3.778 ± 0.497
1.647ArgPro: 1.647 ± 0.39
1.744ArgGln: 1.744 ± 0.496
2.422ArgArg: 2.422 ± 0.495
1.55ArgSer: 1.55 ± 0.303
2.616ArgThr: 2.616 ± 0.803
2.616ArgVal: 2.616 ± 0.508
1.162ArgTrp: 1.162 ± 0.306
2.228ArgTyr: 2.228 ± 0.587
0.0ArgXaa: 0.0 ± 0.0
Ser
5.231SerAla: 5.231 ± 0.827
0.484SerCys: 0.484 ± 0.229
2.809SerAsp: 2.809 ± 0.779
4.844SerGlu: 4.844 ± 0.798
2.906SerPhe: 2.906 ± 0.553
3.681SerGly: 3.681 ± 0.92
0.775SerHis: 0.775 ± 0.354
3.584SerIle: 3.584 ± 0.699
4.65SerLys: 4.65 ± 0.831
5.328SerLeu: 5.328 ± 0.71
1.356SerMet: 1.356 ± 0.484
3.39SerAsn: 3.39 ± 0.547
1.841SerPro: 1.841 ± 0.341
2.519SerGln: 2.519 ± 0.797
2.616SerArg: 2.616 ± 0.782
3.681SerSer: 3.681 ± 0.719
3.003SerThr: 3.003 ± 0.78
3.681SerVal: 3.681 ± 0.945
0.678SerTrp: 0.678 ± 0.21
2.325SerTyr: 2.325 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
4.069ThrAla: 4.069 ± 1.084
0.097ThrCys: 0.097 ± 0.092
4.456ThrAsp: 4.456 ± 0.699
3.584ThrGlu: 3.584 ± 0.544
2.809ThrPhe: 2.809 ± 0.741
4.069ThrGly: 4.069 ± 0.864
0.872ThrHis: 0.872 ± 0.27
4.844ThrIle: 4.844 ± 0.654
5.037ThrLys: 5.037 ± 0.538
5.522ThrLeu: 5.522 ± 0.681
0.581ThrMet: 0.581 ± 0.238
2.616ThrAsn: 2.616 ± 0.473
0.775ThrPro: 0.775 ± 0.22
2.228ThrGln: 2.228 ± 0.68
2.228ThrArg: 2.228 ± 0.395
3.681ThrSer: 3.681 ± 0.925
4.456ThrThr: 4.456 ± 1.143
5.425ThrVal: 5.425 ± 0.805
0.775ThrTrp: 0.775 ± 0.28
1.937ThrTyr: 1.937 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
3.39ValAla: 3.39 ± 0.687
0.484ValCys: 0.484 ± 0.234
3.294ValAsp: 3.294 ± 0.477
6.006ValGlu: 6.006 ± 0.841
2.034ValPhe: 2.034 ± 0.49
4.359ValGly: 4.359 ± 0.639
1.066ValHis: 1.066 ± 0.286
3.584ValIle: 3.584 ± 0.624
6.587ValLys: 6.587 ± 0.644
4.844ValLeu: 4.844 ± 0.87
1.162ValMet: 1.162 ± 0.325
3.294ValAsn: 3.294 ± 0.47
1.259ValPro: 1.259 ± 0.387
2.616ValGln: 2.616 ± 0.474
1.937ValArg: 1.937 ± 0.585
3.875ValSer: 3.875 ± 0.58
5.715ValThr: 5.715 ± 0.816
4.94ValVal: 4.94 ± 0.799
1.162ValTrp: 1.162 ± 0.33
1.453ValTyr: 1.453 ± 0.559
0.0ValXaa: 0.0 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.271
0.097TrpCys: 0.097 ± 0.092
0.484TrpAsp: 0.484 ± 0.235
1.162TrpGlu: 1.162 ± 0.427
0.484TrpPhe: 0.484 ± 0.236
0.775TrpGly: 0.775 ± 0.282
0.484TrpHis: 0.484 ± 0.23
0.969TrpIle: 0.969 ± 0.331
0.678TrpLys: 0.678 ± 0.246
1.259TrpLeu: 1.259 ± 0.313
0.484TrpMet: 0.484 ± 0.19
1.647TrpAsn: 1.647 ± 0.399
0.097TrpPro: 0.097 ± 0.08
0.872TrpGln: 0.872 ± 0.244
0.678TrpArg: 0.678 ± 0.269
0.872TrpSer: 0.872 ± 0.305
0.678TrpThr: 0.678 ± 0.203
0.678TrpVal: 0.678 ± 0.241
0.194TrpTrp: 0.194 ± 0.096
0.775TrpTyr: 0.775 ± 0.677
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.453TyrAla: 1.453 ± 0.348
0.484TyrCys: 0.484 ± 0.216
1.937TyrAsp: 1.937 ± 0.499
2.906TyrGlu: 2.906 ± 0.478
1.647TyrPhe: 1.647 ± 0.599
1.841TyrGly: 1.841 ± 0.417
0.775TyrHis: 0.775 ± 0.228
2.422TyrIle: 2.422 ± 0.47
2.325TyrLys: 2.325 ± 0.619
3.875TyrLeu: 3.875 ± 0.639
1.066TyrMet: 1.066 ± 0.422
1.841TyrAsn: 1.841 ± 0.42
1.647TyrPro: 1.647 ± 0.345
1.356TyrGln: 1.356 ± 0.576
1.937TyrArg: 1.937 ± 0.535
1.453TyrSer: 1.453 ± 0.339
1.453TyrThr: 1.453 ± 0.252
1.841TyrVal: 1.841 ± 0.385
0.484TyrTrp: 0.484 ± 0.189
2.131TyrTyr: 2.131 ± 0.898
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10324 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski