Amino acid dipepetide frequency for Streptococcus phage Javan420

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.588AlaAla: 2.588 ± 0.61
0.685AlaCys: 0.685 ± 0.231
5.252AlaAsp: 5.252 ± 0.666
6.165AlaGlu: 6.165 ± 0.558
2.664AlaPhe: 2.664 ± 0.502
4.795AlaGly: 4.795 ± 1.034
0.609AlaHis: 0.609 ± 0.253
6.241AlaIle: 6.241 ± 0.873
6.013AlaLys: 6.013 ± 0.677
5.024AlaLeu: 5.024 ± 0.959
2.055AlaMet: 2.055 ± 0.461
4.186AlaAsn: 4.186 ± 0.541
1.218AlaPro: 1.218 ± 0.286
2.74AlaGln: 2.74 ± 0.42
2.436AlaArg: 2.436 ± 0.46
4.034AlaSer: 4.034 ± 0.656
3.501AlaThr: 3.501 ± 0.519
4.947AlaVal: 4.947 ± 0.743
1.446AlaTrp: 1.446 ± 0.483
2.36AlaTyr: 2.36 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
0.076CysAla: 0.076 ± 0.076
0.0CysCys: 0.0 ± 0.0
0.381CysAsp: 0.381 ± 0.192
0.913CysGlu: 0.913 ± 0.255
0.381CysPhe: 0.381 ± 0.187
0.761CysGly: 0.761 ± 0.314
0.152CysHis: 0.152 ± 0.104
0.457CysIle: 0.457 ± 0.197
0.533CysLys: 0.533 ± 0.212
0.685CysLeu: 0.685 ± 0.231
0.152CysMet: 0.152 ± 0.117
0.152CysAsn: 0.152 ± 0.104
0.076CysPro: 0.076 ± 0.076
0.152CysGln: 0.152 ± 0.092
0.304CysArg: 0.304 ± 0.134
0.533CysSer: 0.533 ± 0.196
0.076CysThr: 0.076 ± 0.073
0.152CysVal: 0.152 ± 0.134
0.076CysTrp: 0.076 ± 0.073
0.152CysTyr: 0.152 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
5.1AspAla: 5.1 ± 0.608
0.457AspCys: 0.457 ± 0.174
3.577AspAsp: 3.577 ± 0.527
4.947AspGlu: 4.947 ± 0.824
3.273AspPhe: 3.273 ± 0.501
4.034AspGly: 4.034 ± 0.565
1.142AspHis: 1.142 ± 0.294
4.491AspIle: 4.491 ± 0.473
6.165AspLys: 6.165 ± 0.775
6.318AspLeu: 6.318 ± 0.849
1.37AspMet: 1.37 ± 0.373
3.349AspAsn: 3.349 ± 0.731
1.37AspPro: 1.37 ± 0.279
0.761AspGln: 0.761 ± 0.255
3.121AspArg: 3.121 ± 0.515
3.349AspSer: 3.349 ± 0.568
3.577AspThr: 3.577 ± 0.625
3.73AspVal: 3.73 ± 0.553
1.218AspTrp: 1.218 ± 0.344
4.415AspTyr: 4.415 ± 0.792
0.0AspXaa: 0.0 ± 0.0
Glu
5.709GluAla: 5.709 ± 0.699
0.381GluCys: 0.381 ± 0.196
3.045GluAsp: 3.045 ± 0.544
6.013GluGlu: 6.013 ± 0.823
3.121GluPhe: 3.121 ± 0.527
3.958GluGly: 3.958 ± 0.583
0.989GluHis: 0.989 ± 0.237
6.47GluIle: 6.47 ± 0.723
7.079GluLys: 7.079 ± 0.904
7.612GluLeu: 7.612 ± 0.865
1.903GluMet: 1.903 ± 0.361
3.958GluAsn: 3.958 ± 0.473
2.055GluPro: 2.055 ± 0.403
3.577GluGln: 3.577 ± 0.578
3.121GluArg: 3.121 ± 0.557
5.176GluSer: 5.176 ± 0.564
4.871GluThr: 4.871 ± 0.759
5.252GluVal: 5.252 ± 0.578
0.304GluTrp: 0.304 ± 0.157
3.882GluTyr: 3.882 ± 0.533
0.0GluXaa: 0.0 ± 0.0
Phe
1.903PheAla: 1.903 ± 0.441
0.152PheCys: 0.152 ± 0.11
4.034PheAsp: 4.034 ± 0.675
3.882PheGlu: 3.882 ± 0.539
1.675PhePhe: 1.675 ± 0.35
2.816PheGly: 2.816 ± 0.451
0.685PheHis: 0.685 ± 0.229
3.197PheIle: 3.197 ± 0.508
3.73PheLys: 3.73 ± 0.5
3.045PheLeu: 3.045 ± 0.624
0.685PheMet: 0.685 ± 0.208
2.131PheAsn: 2.131 ± 0.371
0.761PhePro: 0.761 ± 0.258
0.989PheGln: 0.989 ± 0.301
1.446PheArg: 1.446 ± 0.305
2.664PheSer: 2.664 ± 0.592
2.664PheThr: 2.664 ± 0.466
2.512PheVal: 2.512 ± 0.338
0.152PheTrp: 0.152 ± 0.107
1.903PheTyr: 1.903 ± 0.375
0.0PheXaa: 0.0 ± 0.0
Gly
4.719GlyAla: 4.719 ± 0.635
0.533GlyCys: 0.533 ± 0.235
4.871GlyAsp: 4.871 ± 0.702
4.034GlyGlu: 4.034 ± 0.697
2.892GlyPhe: 2.892 ± 0.589
3.806GlyGly: 3.806 ± 0.476
0.989GlyHis: 0.989 ± 0.229
3.349GlyIle: 3.349 ± 0.643
5.024GlyLys: 5.024 ± 0.608
5.709GlyLeu: 5.709 ± 0.968
1.294GlyMet: 1.294 ± 0.334
3.806GlyAsn: 3.806 ± 0.561
1.218GlyPro: 1.218 ± 0.379
1.675GlyGln: 1.675 ± 0.399
2.207GlyArg: 2.207 ± 0.408
2.588GlySer: 2.588 ± 0.366
3.045GlyThr: 3.045 ± 0.425
4.871GlyVal: 4.871 ± 0.721
0.685GlyTrp: 0.685 ± 0.264
2.968GlyTyr: 2.968 ± 0.514
0.0GlyXaa: 0.0 ± 0.0
His
0.609HisAla: 0.609 ± 0.228
0.152HisCys: 0.152 ± 0.092
0.761HisAsp: 0.761 ± 0.275
1.066HisGlu: 1.066 ± 0.3
0.761HisPhe: 0.761 ± 0.277
0.457HisGly: 0.457 ± 0.165
0.152HisHis: 0.152 ± 0.107
0.533HisIle: 0.533 ± 0.23
1.294HisLys: 1.294 ± 0.294
1.142HisLeu: 1.142 ± 0.278
0.152HisMet: 0.152 ± 0.102
0.609HisAsn: 0.609 ± 0.217
0.228HisPro: 0.228 ± 0.116
0.761HisGln: 0.761 ± 0.255
0.381HisArg: 0.381 ± 0.173
0.533HisSer: 0.533 ± 0.188
0.989HisThr: 0.989 ± 0.282
1.142HisVal: 1.142 ± 0.294
0.076HisTrp: 0.076 ± 0.073
0.381HisTyr: 0.381 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
5.328IleAla: 5.328 ± 0.533
0.685IleCys: 0.685 ± 0.226
5.785IleAsp: 5.785 ± 0.631
5.937IleGlu: 5.937 ± 0.668
1.827IlePhe: 1.827 ± 0.327
4.491IleGly: 4.491 ± 0.69
0.685IleHis: 0.685 ± 0.298
4.491IleIle: 4.491 ± 0.802
9.134IleLys: 9.134 ± 0.903
5.633IleLeu: 5.633 ± 0.785
1.37IleMet: 1.37 ± 0.374
3.73IleAsn: 3.73 ± 0.494
1.903IlePro: 1.903 ± 0.383
2.055IleGln: 2.055 ± 0.411
2.664IleArg: 2.664 ± 0.529
4.034IleSer: 4.034 ± 0.638
4.947IleThr: 4.947 ± 0.682
4.339IleVal: 4.339 ± 0.759
0.837IleTrp: 0.837 ± 0.413
2.816IleTyr: 2.816 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
6.85LysAla: 6.85 ± 0.853
0.457LysCys: 0.457 ± 0.215
4.795LysAsp: 4.795 ± 0.706
7.231LysGlu: 7.231 ± 1.02
2.512LysPhe: 2.512 ± 0.449
5.633LysGly: 5.633 ± 0.779
1.066LysHis: 1.066 ± 0.25
6.698LysIle: 6.698 ± 0.869
7.916LysLys: 7.916 ± 1.216
8.297LysLeu: 8.297 ± 0.784
3.273LysMet: 3.273 ± 0.561
6.013LysAsn: 6.013 ± 0.67
2.74LysPro: 2.74 ± 0.456
3.654LysGln: 3.654 ± 0.509
4.262LysArg: 4.262 ± 0.771
6.241LysSer: 6.241 ± 0.695
5.709LysThr: 5.709 ± 0.714
6.85LysVal: 6.85 ± 0.827
0.761LysTrp: 0.761 ± 0.238
2.436LysTyr: 2.436 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
7.079LeuAla: 7.079 ± 1.065
0.228LeuCys: 0.228 ± 0.124
6.318LeuAsp: 6.318 ± 0.833
6.241LeuGlu: 6.241 ± 0.669
2.664LeuPhe: 2.664 ± 0.404
4.871LeuGly: 4.871 ± 0.558
1.142LeuHis: 1.142 ± 0.333
5.785LeuIle: 5.785 ± 0.691
8.905LeuLys: 8.905 ± 0.734
5.556LeuLeu: 5.556 ± 0.63
1.522LeuMet: 1.522 ± 0.281
4.643LeuAsn: 4.643 ± 0.601
2.436LeuPro: 2.436 ± 0.387
2.436LeuGln: 2.436 ± 0.404
4.262LeuArg: 4.262 ± 0.683
5.785LeuSer: 5.785 ± 0.811
5.556LeuThr: 5.556 ± 0.592
3.349LeuVal: 3.349 ± 0.54
0.989LeuTrp: 0.989 ± 0.327
2.74LeuTyr: 2.74 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
2.816MetAla: 2.816 ± 0.493
0.076MetCys: 0.076 ± 0.073
0.913MetAsp: 0.913 ± 0.251
1.37MetGlu: 1.37 ± 0.373
0.989MetPhe: 0.989 ± 0.243
0.989MetGly: 0.989 ± 0.27
0.152MetHis: 0.152 ± 0.105
1.827MetIle: 1.827 ± 0.298
1.675MetLys: 1.675 ± 0.337
1.522MetLeu: 1.522 ± 0.331
0.381MetMet: 0.381 ± 0.18
1.37MetAsn: 1.37 ± 0.359
1.142MetPro: 1.142 ± 0.32
1.37MetGln: 1.37 ± 0.304
1.218MetArg: 1.218 ± 0.264
1.979MetSer: 1.979 ± 0.382
1.294MetThr: 1.294 ± 0.28
0.837MetVal: 0.837 ± 0.386
0.381MetTrp: 0.381 ± 0.177
0.533MetTyr: 0.533 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.871AsnAla: 4.871 ± 0.927
0.228AsnCys: 0.228 ± 0.13
2.36AsnAsp: 2.36 ± 0.429
3.958AsnGlu: 3.958 ± 0.547
2.664AsnPhe: 2.664 ± 0.449
3.882AsnGly: 3.882 ± 0.511
0.533AsnHis: 0.533 ± 0.255
3.654AsnIle: 3.654 ± 0.495
4.643AsnLys: 4.643 ± 0.628
5.328AsnLeu: 5.328 ± 0.525
1.446AsnMet: 1.446 ± 0.331
3.501AsnAsn: 3.501 ± 0.56
1.675AsnPro: 1.675 ± 0.426
2.588AsnGln: 2.588 ± 0.445
1.827AsnArg: 1.827 ± 0.326
4.643AsnSer: 4.643 ± 0.665
2.892AsnThr: 2.892 ± 0.618
3.045AsnVal: 3.045 ± 0.563
1.142AsnTrp: 1.142 ± 0.271
1.903AsnTyr: 1.903 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
1.446ProAla: 1.446 ± 0.352
0.228ProCys: 0.228 ± 0.124
2.588ProAsp: 2.588 ± 0.484
2.436ProGlu: 2.436 ± 0.47
1.066ProPhe: 1.066 ± 0.274
0.989ProGly: 0.989 ± 0.303
0.533ProHis: 0.533 ± 0.18
1.675ProIle: 1.675 ± 0.349
2.664ProLys: 2.664 ± 0.437
1.446ProLeu: 1.446 ± 0.335
0.152ProMet: 0.152 ± 0.112
1.37ProAsn: 1.37 ± 0.3
0.457ProPro: 0.457 ± 0.152
1.066ProGln: 1.066 ± 0.279
1.446ProArg: 1.446 ± 0.337
1.598ProSer: 1.598 ± 0.398
1.446ProThr: 1.446 ± 0.276
1.675ProVal: 1.675 ± 0.401
0.0ProTrp: 0.0 ± 0.0
1.142ProTyr: 1.142 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.045GlnAla: 3.045 ± 0.604
0.304GlnCys: 0.304 ± 0.164
1.598GlnAsp: 1.598 ± 0.344
2.74GlnGlu: 2.74 ± 0.467
1.294GlnPhe: 1.294 ± 0.299
1.675GlnGly: 1.675 ± 0.395
0.152GlnHis: 0.152 ± 0.099
2.74GlnIle: 2.74 ± 0.444
3.958GlnLys: 3.958 ± 0.583
3.577GlnLeu: 3.577 ± 0.5
0.837GlnMet: 0.837 ± 0.277
2.055GlnAsn: 2.055 ± 0.397
0.761GlnPro: 0.761 ± 0.255
1.903GlnGln: 1.903 ± 0.423
0.989GlnArg: 0.989 ± 0.238
3.197GlnSer: 3.197 ± 0.512
1.827GlnThr: 1.827 ± 0.338
2.283GlnVal: 2.283 ± 0.331
0.304GlnTrp: 0.304 ± 0.127
0.837GlnTyr: 0.837 ± 0.217
0.0GlnXaa: 0.0 ± 0.0
Arg
2.207ArgAla: 2.207 ± 0.345
0.457ArgCys: 0.457 ± 0.188
2.283ArgAsp: 2.283 ± 0.51
3.577ArgGlu: 3.577 ± 0.485
1.598ArgPhe: 1.598 ± 0.306
2.512ArgGly: 2.512 ± 0.497
0.609ArgHis: 0.609 ± 0.248
2.968ArgIle: 2.968 ± 0.515
4.415ArgLys: 4.415 ± 0.549
4.186ArgLeu: 4.186 ± 0.584
0.761ArgMet: 0.761 ± 0.219
2.207ArgAsn: 2.207 ± 0.4
1.066ArgPro: 1.066 ± 0.28
1.903ArgGln: 1.903 ± 0.454
2.436ArgArg: 2.436 ± 0.503
1.751ArgSer: 1.751 ± 0.336
1.446ArgThr: 1.446 ± 0.29
2.664ArgVal: 2.664 ± 0.392
0.457ArgTrp: 0.457 ± 0.187
1.751ArgTyr: 1.751 ± 0.504
0.0ArgXaa: 0.0 ± 0.0
Ser
4.186SerAla: 4.186 ± 0.638
0.228SerCys: 0.228 ± 0.157
4.947SerAsp: 4.947 ± 0.631
6.089SerGlu: 6.089 ± 0.653
3.273SerPhe: 3.273 ± 0.507
4.795SerGly: 4.795 ± 1.025
1.066SerHis: 1.066 ± 0.226
4.186SerIle: 4.186 ± 0.52
5.785SerLys: 5.785 ± 0.666
4.262SerLeu: 4.262 ± 0.553
1.294SerMet: 1.294 ± 0.375
3.73SerAsn: 3.73 ± 0.591
0.837SerPro: 0.837 ± 0.255
1.294SerGln: 1.294 ± 0.308
2.664SerArg: 2.664 ± 0.435
3.501SerSer: 3.501 ± 0.603
3.654SerThr: 3.654 ± 0.508
4.491SerVal: 4.491 ± 0.637
0.837SerTrp: 0.837 ± 0.205
1.827SerTyr: 1.827 ± 0.428
0.0SerXaa: 0.0 ± 0.0
Thr
3.73ThrAla: 3.73 ± 0.516
0.152ThrCys: 0.152 ± 0.096
3.349ThrAsp: 3.349 ± 0.441
4.11ThrGlu: 4.11 ± 0.592
2.283ThrPhe: 2.283 ± 0.409
3.73ThrGly: 3.73 ± 0.467
0.457ThrHis: 0.457 ± 0.163
5.252ThrIle: 5.252 ± 0.705
5.861ThrLys: 5.861 ± 0.853
4.871ThrLeu: 4.871 ± 0.578
1.218ThrMet: 1.218 ± 0.269
3.501ThrAsn: 3.501 ± 0.478
2.207ThrPro: 2.207 ± 0.481
2.207ThrGln: 2.207 ± 0.374
1.827ThrArg: 1.827 ± 0.357
3.197ThrSer: 3.197 ± 0.509
4.719ThrThr: 4.719 ± 0.628
3.045ThrVal: 3.045 ± 0.527
0.761ThrTrp: 0.761 ± 0.252
2.436ThrTyr: 2.436 ± 0.542
0.0ThrXaa: 0.0 ± 0.0
Val
4.034ValAla: 4.034 ± 0.573
0.304ValCys: 0.304 ± 0.171
4.643ValAsp: 4.643 ± 0.588
4.415ValGlu: 4.415 ± 0.477
3.121ValPhe: 3.121 ± 0.566
3.349ValGly: 3.349 ± 0.582
0.457ValHis: 0.457 ± 0.165
4.795ValIle: 4.795 ± 0.694
4.491ValLys: 4.491 ± 0.487
4.339ValLeu: 4.339 ± 0.638
1.827ValMet: 1.827 ± 0.383
3.425ValAsn: 3.425 ± 0.447
1.751ValPro: 1.751 ± 0.432
1.903ValGln: 1.903 ± 0.314
2.664ValArg: 2.664 ± 0.521
5.404ValSer: 5.404 ± 0.66
4.262ValThr: 4.262 ± 0.639
4.795ValVal: 4.795 ± 0.786
1.522ValTrp: 1.522 ± 0.47
1.751ValTyr: 1.751 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
0.457TrpAla: 0.457 ± 0.207
0.228TrpCys: 0.228 ± 0.122
0.761TrpAsp: 0.761 ± 0.296
1.142TrpGlu: 1.142 ± 0.546
0.837TrpPhe: 0.837 ± 0.231
0.533TrpGly: 0.533 ± 0.168
0.304TrpHis: 0.304 ± 0.119
0.913TrpIle: 0.913 ± 0.31
0.989TrpLys: 0.989 ± 0.251
0.761TrpLeu: 0.761 ± 0.239
0.304TrpMet: 0.304 ± 0.149
1.066TrpAsn: 1.066 ± 0.366
0.228TrpPro: 0.228 ± 0.122
0.685TrpGln: 0.685 ± 0.204
0.609TrpArg: 0.609 ± 0.224
0.685TrpSer: 0.685 ± 0.203
0.761TrpThr: 0.761 ± 0.199
0.533TrpVal: 0.533 ± 0.208
0.152TrpTrp: 0.152 ± 0.102
0.381TrpTyr: 0.381 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.283TyrAla: 2.283 ± 0.454
0.228TyrCys: 0.228 ± 0.12
3.501TyrAsp: 3.501 ± 0.506
2.207TyrGlu: 2.207 ± 0.39
2.131TyrPhe: 2.131 ± 0.446
2.207TyrGly: 2.207 ± 0.475
0.304TyrHis: 0.304 ± 0.14
2.968TyrIle: 2.968 ± 0.536
3.121TyrLys: 3.121 ± 0.543
3.197TyrLeu: 3.197 ± 0.646
0.761TyrMet: 0.761 ± 0.23
1.979TyrAsn: 1.979 ± 0.31
1.294TyrPro: 1.294 ± 0.366
2.36TyrGln: 2.36 ± 0.445
1.37TyrArg: 1.37 ± 0.39
2.055TyrSer: 2.055 ± 0.394
1.675TyrThr: 1.675 ± 0.339
2.74TyrVal: 2.74 ± 0.406
0.228TyrTrp: 0.228 ± 0.114
2.131TyrTyr: 2.131 ± 0.488
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski