Amino acid dipepetide frequency for Streptococcus phage Javan126

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.794AlaAla: 3.794 ± 1.084
0.37AlaCys: 0.37 ± 0.195
3.609AlaAsp: 3.609 ± 0.462
5.09AlaGlu: 5.09 ± 0.753
2.128AlaPhe: 2.128 ± 0.528
3.887AlaGly: 3.887 ± 0.941
0.74AlaHis: 0.74 ± 0.287
5.182AlaIle: 5.182 ± 0.742
6.293AlaLys: 6.293 ± 1.025
6.57AlaLeu: 6.57 ± 1.402
1.481AlaMet: 1.481 ± 0.379
4.72AlaAsn: 4.72 ± 0.876
1.573AlaPro: 1.573 ± 0.458
3.517AlaGln: 3.517 ± 0.863
2.776AlaArg: 2.776 ± 0.472
5.182AlaSer: 5.182 ± 1.296
3.424AlaThr: 3.424 ± 0.554
3.609AlaVal: 3.609 ± 0.614
0.37AlaTrp: 0.37 ± 0.251
1.943AlaTyr: 1.943 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.185CysAla: 0.185 ± 0.119
0.0CysCys: 0.0 ± 0.0
0.185CysAsp: 0.185 ± 0.151
0.555CysGlu: 0.555 ± 0.269
0.185CysPhe: 0.185 ± 0.136
0.37CysGly: 0.37 ± 0.163
0.093CysHis: 0.093 ± 0.096
0.74CysIle: 0.74 ± 0.26
0.463CysLys: 0.463 ± 0.227
0.093CysLeu: 0.093 ± 0.085
0.185CysMet: 0.185 ± 0.154
0.093CysAsn: 0.093 ± 0.088
0.093CysPro: 0.093 ± 0.093
0.0CysGln: 0.0 ± 0.0
0.185CysArg: 0.185 ± 0.129
0.093CysSer: 0.093 ± 0.094
0.0CysThr: 0.0 ± 0.0
0.37CysVal: 0.37 ± 0.172
0.185CysTrp: 0.185 ± 0.132
0.185CysTyr: 0.185 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
3.517AspAla: 3.517 ± 0.486
0.37AspCys: 0.37 ± 0.18
4.257AspAsp: 4.257 ± 0.737
5.552AspGlu: 5.552 ± 0.611
3.146AspPhe: 3.146 ± 0.535
5.09AspGly: 5.09 ± 0.945
0.555AspHis: 0.555 ± 0.225
5.552AspIle: 5.552 ± 0.833
6.663AspLys: 6.663 ± 0.678
6.756AspLeu: 6.756 ± 0.8
1.851AspMet: 1.851 ± 0.406
3.794AspAsn: 3.794 ± 0.447
1.573AspPro: 1.573 ± 0.346
1.943AspGln: 1.943 ± 0.361
1.758AspArg: 1.758 ± 0.451
3.794AspSer: 3.794 ± 0.443
2.961AspThr: 2.961 ± 0.49
3.609AspVal: 3.609 ± 0.523
1.388AspTrp: 1.388 ± 0.277
3.609AspTyr: 3.609 ± 0.709
0.0AspXaa: 0.0 ± 0.0
Glu
5.83GluAla: 5.83 ± 1.194
0.463GluCys: 0.463 ± 0.224
3.424GluAsp: 3.424 ± 0.555
6.663GluGlu: 6.663 ± 1.094
3.146GluPhe: 3.146 ± 0.571
3.424GluGly: 3.424 ± 0.65
0.833GluHis: 0.833 ± 0.276
7.218GluIle: 7.218 ± 0.873
6.2GluLys: 6.2 ± 0.755
8.421GluLeu: 8.421 ± 0.93
2.221GluMet: 2.221 ± 0.462
3.517GluAsn: 3.517 ± 0.695
2.314GluPro: 2.314 ± 0.537
3.702GluGln: 3.702 ± 0.889
3.331GluArg: 3.331 ± 0.473
4.997GluSer: 4.997 ± 0.612
4.072GluThr: 4.072 ± 0.514
5.645GluVal: 5.645 ± 1.193
0.925GluTrp: 0.925 ± 0.29
2.499GluTyr: 2.499 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
1.758PheAla: 1.758 ± 0.336
0.093PheCys: 0.093 ± 0.118
3.609PheAsp: 3.609 ± 0.585
3.424PheGlu: 3.424 ± 0.768
1.573PhePhe: 1.573 ± 0.387
2.776PheGly: 2.776 ± 0.515
0.555PheHis: 0.555 ± 0.278
3.054PheIle: 3.054 ± 0.527
4.072PheLys: 4.072 ± 0.559
2.406PheLeu: 2.406 ± 0.418
1.11PheMet: 1.11 ± 0.305
2.776PheAsn: 2.776 ± 0.468
0.925PhePro: 0.925 ± 0.322
1.11PheGln: 1.11 ± 0.319
1.758PheArg: 1.758 ± 0.369
1.666PheSer: 1.666 ± 0.378
2.221PheThr: 2.221 ± 0.41
3.054PheVal: 3.054 ± 0.572
0.278PheTrp: 0.278 ± 0.154
1.018PheTyr: 1.018 ± 0.312
0.0PheXaa: 0.0 ± 0.0
Gly
3.146GlyAla: 3.146 ± 0.73
0.278GlyCys: 0.278 ± 0.142
4.164GlyAsp: 4.164 ± 0.795
4.442GlyGlu: 4.442 ± 0.735
2.314GlyPhe: 2.314 ± 0.525
3.609GlyGly: 3.609 ± 0.492
0.833GlyHis: 0.833 ± 0.199
4.349GlyIle: 4.349 ± 0.528
4.72GlyLys: 4.72 ± 0.609
6.756GlyLeu: 6.756 ± 0.908
2.036GlyMet: 2.036 ± 0.359
3.887GlyAsn: 3.887 ± 0.52
1.388GlyPro: 1.388 ± 0.484
2.684GlyGln: 2.684 ± 0.694
3.239GlyArg: 3.239 ± 0.536
1.573GlySer: 1.573 ± 0.476
3.517GlyThr: 3.517 ± 0.724
2.961GlyVal: 2.961 ± 0.666
1.018GlyTrp: 1.018 ± 0.3
2.776GlyTyr: 2.776 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
0.648HisAla: 0.648 ± 0.274
0.0HisCys: 0.0 ± 0.0
0.833HisAsp: 0.833 ± 0.352
1.11HisGlu: 1.11 ± 0.293
0.555HisPhe: 0.555 ± 0.227
1.018HisGly: 1.018 ± 0.209
0.185HisHis: 0.185 ± 0.122
0.555HisIle: 0.555 ± 0.23
0.833HisLys: 0.833 ± 0.278
0.925HisLeu: 0.925 ± 0.284
0.463HisMet: 0.463 ± 0.232
0.74HisAsn: 0.74 ± 0.23
0.74HisPro: 0.74 ± 0.219
0.74HisGln: 0.74 ± 0.313
0.74HisArg: 0.74 ± 0.367
0.833HisSer: 0.833 ± 0.333
0.74HisThr: 0.74 ± 0.287
0.648HisVal: 0.648 ± 0.25
0.185HisTrp: 0.185 ± 0.128
0.37HisTyr: 0.37 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.182IleAla: 5.182 ± 0.74
0.37IleCys: 0.37 ± 0.178
6.663IleAsp: 6.663 ± 0.81
6.57IleGlu: 6.57 ± 0.893
2.869IlePhe: 2.869 ± 0.693
4.072IleGly: 4.072 ± 0.521
0.74IleHis: 0.74 ± 0.255
3.424IleIle: 3.424 ± 0.648
8.051IleLys: 8.051 ± 0.995
4.257IleLeu: 4.257 ± 0.738
0.648IleMet: 0.648 ± 0.263
4.349IleAsn: 4.349 ± 0.737
2.406IlePro: 2.406 ± 0.512
1.573IleGln: 1.573 ± 0.397
2.776IleArg: 2.776 ± 0.489
4.072IleSer: 4.072 ± 0.602
4.164IleThr: 4.164 ± 0.671
3.887IleVal: 3.887 ± 0.554
0.37IleTrp: 0.37 ± 0.166
3.054IleTyr: 3.054 ± 0.565
0.0IleXaa: 0.0 ± 0.0
Lys
6.848LysAla: 6.848 ± 1.117
0.463LysCys: 0.463 ± 0.184
6.941LysAsp: 6.941 ± 0.607
7.496LysGlu: 7.496 ± 0.95
3.517LysPhe: 3.517 ± 0.62
5.275LysGly: 5.275 ± 0.962
0.833LysHis: 0.833 ± 0.305
6.756LysIle: 6.756 ± 0.777
7.126LysLys: 7.126 ± 0.815
8.791LysLeu: 8.791 ± 0.902
2.406LysMet: 2.406 ± 0.42
6.663LysAsn: 6.663 ± 0.746
2.406LysPro: 2.406 ± 0.521
3.609LysGln: 3.609 ± 0.633
4.349LysArg: 4.349 ± 0.614
4.349LysSer: 4.349 ± 0.608
5.09LysThr: 5.09 ± 0.718
6.293LysVal: 6.293 ± 0.883
1.018LysTrp: 1.018 ± 0.255
2.314LysTyr: 2.314 ± 0.479
0.0LysXaa: 0.0 ± 0.0
Leu
5.645LeuAla: 5.645 ± 1.004
0.093LeuCys: 0.093 ± 0.096
6.478LeuAsp: 6.478 ± 0.743
6.756LeuGlu: 6.756 ± 0.955
2.684LeuPhe: 2.684 ± 0.589
5.09LeuGly: 5.09 ± 0.788
1.11LeuHis: 1.11 ± 0.357
4.072LeuIle: 4.072 ± 0.53
10.827LeuLys: 10.827 ± 0.993
7.033LeuLeu: 7.033 ± 0.806
1.943LeuMet: 1.943 ± 0.5
5.275LeuAsn: 5.275 ± 0.819
3.517LeuPro: 3.517 ± 0.486
3.424LeuGln: 3.424 ± 0.626
3.054LeuArg: 3.054 ± 0.688
7.496LeuSer: 7.496 ± 0.759
5.83LeuThr: 5.83 ± 0.82
3.979LeuVal: 3.979 ± 0.48
0.648LeuTrp: 0.648 ± 0.196
2.036LeuTyr: 2.036 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
1.758MetAla: 1.758 ± 0.329
0.093MetCys: 0.093 ± 0.093
1.388MetAsp: 1.388 ± 0.35
1.943MetGlu: 1.943 ± 0.424
0.925MetPhe: 0.925 ± 0.27
0.833MetGly: 0.833 ± 0.262
0.0MetHis: 0.0 ± 0.0
2.591MetIle: 2.591 ± 0.336
1.481MetLys: 1.481 ± 0.333
2.128MetLeu: 2.128 ± 0.521
0.74MetMet: 0.74 ± 0.335
0.925MetAsn: 0.925 ± 0.265
0.555MetPro: 0.555 ± 0.222
0.463MetGln: 0.463 ± 0.199
1.481MetArg: 1.481 ± 0.413
1.573MetSer: 1.573 ± 0.443
2.869MetThr: 2.869 ± 0.553
1.758MetVal: 1.758 ± 0.376
0.37MetTrp: 0.37 ± 0.202
1.018MetTyr: 1.018 ± 0.343
0.0MetXaa: 0.0 ± 0.0
Asn
3.239AsnAla: 3.239 ± 0.774
0.555AsnCys: 0.555 ± 0.207
3.794AsnAsp: 3.794 ± 0.477
2.869AsnGlu: 2.869 ± 0.51
2.314AsnPhe: 2.314 ± 0.453
4.627AsnGly: 4.627 ± 0.567
1.11AsnHis: 1.11 ± 0.339
3.794AsnIle: 3.794 ± 0.695
4.072AsnLys: 4.072 ± 0.555
4.905AsnLeu: 4.905 ± 0.545
1.851AsnMet: 1.851 ± 0.439
3.239AsnAsn: 3.239 ± 0.666
2.684AsnPro: 2.684 ± 0.437
3.424AsnGln: 3.424 ± 0.619
3.146AsnArg: 3.146 ± 0.51
3.239AsnSer: 3.239 ± 0.742
3.424AsnThr: 3.424 ± 0.518
3.517AsnVal: 3.517 ± 0.82
0.833AsnTrp: 0.833 ± 0.266
2.591AsnTyr: 2.591 ± 0.529
0.0AsnXaa: 0.0 ± 0.0
Pro
1.481ProAla: 1.481 ± 0.338
0.185ProCys: 0.185 ± 0.131
2.221ProAsp: 2.221 ± 0.49
2.406ProGlu: 2.406 ± 0.354
1.296ProPhe: 1.296 ± 0.291
0.925ProGly: 0.925 ± 0.208
0.37ProHis: 0.37 ± 0.194
2.314ProIle: 2.314 ± 0.395
3.239ProLys: 3.239 ± 0.502
2.036ProLeu: 2.036 ± 0.398
0.555ProMet: 0.555 ± 0.282
1.758ProAsn: 1.758 ± 0.471
0.74ProPro: 0.74 ± 0.198
1.296ProGln: 1.296 ± 0.347
1.296ProArg: 1.296 ± 0.346
1.943ProSer: 1.943 ± 0.42
2.036ProThr: 2.036 ± 0.396
1.666ProVal: 1.666 ± 0.289
0.185ProTrp: 0.185 ± 0.121
0.925ProTyr: 0.925 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
2.128GlnAla: 2.128 ± 0.495
0.185GlnCys: 0.185 ± 0.132
1.758GlnAsp: 1.758 ± 0.423
3.146GlnGlu: 3.146 ± 0.518
1.388GlnPhe: 1.388 ± 0.397
1.573GlnGly: 1.573 ± 0.387
0.555GlnHis: 0.555 ± 0.202
2.036GlnIle: 2.036 ± 0.31
4.905GlnLys: 4.905 ± 0.656
4.72GlnLeu: 4.72 ± 0.777
0.925GlnMet: 0.925 ± 0.382
2.591GlnAsn: 2.591 ± 0.534
1.018GlnPro: 1.018 ± 0.312
1.018GlnGln: 1.018 ± 0.334
2.128GlnArg: 2.128 ± 0.472
3.146GlnSer: 3.146 ± 0.548
2.406GlnThr: 2.406 ± 0.419
2.406GlnVal: 2.406 ± 0.44
0.185GlnTrp: 0.185 ± 0.122
1.481GlnTyr: 1.481 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
2.961ArgAla: 2.961 ± 0.677
0.093ArgCys: 0.093 ± 0.101
3.146ArgAsp: 3.146 ± 0.541
3.239ArgGlu: 3.239 ± 0.568
1.481ArgPhe: 1.481 ± 0.293
2.314ArgGly: 2.314 ± 0.47
1.018ArgHis: 1.018 ± 0.365
3.146ArgIle: 3.146 ± 0.768
3.331ArgLys: 3.331 ± 0.648
3.794ArgLeu: 3.794 ± 0.499
1.203ArgMet: 1.203 ± 0.303
2.406ArgAsn: 2.406 ± 0.542
0.74ArgPro: 0.74 ± 0.213
2.221ArgGln: 2.221 ± 0.376
1.943ArgArg: 1.943 ± 0.534
1.851ArgSer: 1.851 ± 0.343
2.036ArgThr: 2.036 ± 0.417
2.776ArgVal: 2.776 ± 0.358
0.925ArgTrp: 0.925 ± 0.306
2.499ArgTyr: 2.499 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
4.627SerAla: 4.627 ± 0.705
0.185SerCys: 0.185 ± 0.13
4.627SerAsp: 4.627 ± 0.864
4.997SerGlu: 4.997 ± 0.665
2.684SerPhe: 2.684 ± 0.495
3.979SerGly: 3.979 ± 0.714
1.11SerHis: 1.11 ± 0.356
3.979SerIle: 3.979 ± 0.555
5.182SerLys: 5.182 ± 0.773
3.887SerLeu: 3.887 ± 0.569
1.11SerMet: 1.11 ± 0.34
3.979SerAsn: 3.979 ± 0.777
2.036SerPro: 2.036 ± 0.406
2.591SerGln: 2.591 ± 0.47
2.036SerArg: 2.036 ± 0.412
3.054SerSer: 3.054 ± 0.667
3.054SerThr: 3.054 ± 0.632
3.609SerVal: 3.609 ± 0.603
0.463SerTrp: 0.463 ± 0.202
1.851SerTyr: 1.851 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
5.09ThrAla: 5.09 ± 1.118
0.185ThrCys: 0.185 ± 0.17
2.961ThrAsp: 2.961 ± 0.462
3.702ThrGlu: 3.702 ± 0.681
2.776ThrPhe: 2.776 ± 0.504
4.627ThrGly: 4.627 ± 0.6
0.37ThrHis: 0.37 ± 0.156
3.887ThrIle: 3.887 ± 0.525
6.2ThrLys: 6.2 ± 0.664
4.812ThrLeu: 4.812 ± 0.533
1.018ThrMet: 1.018 ± 0.274
2.684ThrAsn: 2.684 ± 0.446
1.851ThrPro: 1.851 ± 0.392
1.943ThrGln: 1.943 ± 0.297
1.758ThrArg: 1.758 ± 0.436
3.979ThrSer: 3.979 ± 0.548
3.702ThrThr: 3.702 ± 0.785
3.331ThrVal: 3.331 ± 0.533
0.555ThrTrp: 0.555 ± 0.241
2.314ThrTyr: 2.314 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
5.552ValAla: 5.552 ± 0.924
0.185ValCys: 0.185 ± 0.173
4.535ValAsp: 4.535 ± 0.64
5.09ValGlu: 5.09 ± 0.776
2.036ValPhe: 2.036 ± 0.366
3.146ValGly: 3.146 ± 0.429
0.555ValHis: 0.555 ± 0.235
3.424ValIle: 3.424 ± 0.623
4.997ValLys: 4.997 ± 0.541
4.72ValLeu: 4.72 ± 0.575
1.666ValMet: 1.666 ± 0.431
2.869ValAsn: 2.869 ± 0.385
1.11ValPro: 1.11 ± 0.321
1.758ValGln: 1.758 ± 0.283
3.239ValArg: 3.239 ± 0.503
3.702ValSer: 3.702 ± 0.438
4.535ValThr: 4.535 ± 0.825
4.535ValVal: 4.535 ± 0.64
0.74ValTrp: 0.74 ± 0.265
2.406ValTyr: 2.406 ± 0.748
0.0ValXaa: 0.0 ± 0.0
Trp
0.648TrpAla: 0.648 ± 0.219
0.093TrpCys: 0.093 ± 0.096
0.463TrpAsp: 0.463 ± 0.226
0.925TrpGlu: 0.925 ± 0.174
0.555TrpPhe: 0.555 ± 0.255
0.833TrpGly: 0.833 ± 0.203
0.37TrpHis: 0.37 ± 0.165
1.018TrpIle: 1.018 ± 0.32
0.925TrpLys: 0.925 ± 0.341
1.11TrpLeu: 1.11 ± 0.291
0.37TrpMet: 0.37 ± 0.201
0.74TrpAsn: 0.74 ± 0.293
0.093TrpPro: 0.093 ± 0.103
0.555TrpGln: 0.555 ± 0.173
0.37TrpArg: 0.37 ± 0.163
0.555TrpSer: 0.555 ± 0.216
0.555TrpThr: 0.555 ± 0.207
0.37TrpVal: 0.37 ± 0.159
0.093TrpTrp: 0.093 ± 0.096
0.648TrpTyr: 0.648 ± 0.251
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.314TyrAla: 2.314 ± 0.474
0.0TyrCys: 0.0 ± 0.0
2.684TyrAsp: 2.684 ± 0.423
2.869TyrGlu: 2.869 ± 0.443
1.666TyrPhe: 1.666 ± 0.367
2.221TyrGly: 2.221 ± 0.469
0.925TyrHis: 0.925 ± 0.302
2.406TyrIle: 2.406 ± 0.793
2.961TyrLys: 2.961 ± 0.592
2.869TyrLeu: 2.869 ± 0.61
1.018TyrMet: 1.018 ± 0.342
2.221TyrAsn: 2.221 ± 0.489
1.11TyrPro: 1.11 ± 0.31
2.128TyrGln: 2.128 ± 0.604
1.758TyrArg: 1.758 ± 0.41
1.943TyrSer: 1.943 ± 0.406
1.11TyrThr: 1.11 ± 0.39
2.776TyrVal: 2.776 ± 0.645
0.555TyrTrp: 0.555 ± 0.202
1.851TyrTyr: 1.851 ± 0.406
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski