Amino acid dipepetide frequency for Streptococcus satellite phage Javan303

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.597AlaAla: 0.597 ± 0.445
0.0AlaCys: 0.0 ± 0.0
1.492AlaAsp: 1.492 ± 0.465
3.879AlaGlu: 3.879 ± 1.162
2.387AlaPhe: 2.387 ± 0.654
1.194AlaGly: 1.194 ± 0.486
0.597AlaHis: 0.597 ± 0.445
2.387AlaIle: 2.387 ± 0.851
4.775AlaLys: 4.775 ± 0.941
2.984AlaLeu: 2.984 ± 1.163
2.089AlaMet: 2.089 ± 0.852
3.581AlaAsn: 3.581 ± 0.681
1.194AlaPro: 1.194 ± 0.557
0.895AlaGln: 0.895 ± 0.522
2.387AlaArg: 2.387 ± 1.153
1.791AlaSer: 1.791 ± 0.857
4.178AlaThr: 4.178 ± 0.791
2.089AlaVal: 2.089 ± 0.912
0.0AlaTrp: 0.0 ± 0.0
2.686AlaTyr: 2.686 ± 0.9
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.597CysAsp: 0.597 ± 0.357
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.895CysGly: 0.895 ± 0.479
0.0CysHis: 0.0 ± 0.0
0.298CysIle: 0.298 ± 0.273
0.298CysLys: 0.298 ± 0.322
0.895CysLeu: 0.895 ± 0.577
0.298CysMet: 0.298 ± 0.307
0.0CysAsn: 0.0 ± 0.0
0.597CysPro: 0.597 ± 0.388
0.298CysGln: 0.298 ± 0.285
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.597CysThr: 0.597 ± 0.431
0.298CysVal: 0.298 ± 0.262
0.0CysTrp: 0.0 ± 0.0
0.298CysTyr: 0.298 ± 0.285
0.0CysXaa: 0.0 ± 0.0
Asp
0.895AspAla: 0.895 ± 0.556
0.597AspCys: 0.597 ± 0.376
5.073AspAsp: 5.073 ± 1.47
2.089AspGlu: 2.089 ± 0.491
5.372AspPhe: 5.372 ± 1.29
3.879AspGly: 3.879 ± 0.817
0.298AspHis: 0.298 ± 0.278
5.968AspIle: 5.968 ± 1.237
6.864AspLys: 6.864 ± 1.403
7.46AspLeu: 7.46 ± 1.261
0.895AspMet: 0.895 ± 0.451
3.283AspAsn: 3.283 ± 0.775
0.298AspPro: 0.298 ± 0.322
0.895AspGln: 0.895 ± 0.382
2.387AspArg: 2.387 ± 0.67
4.476AspSer: 4.476 ± 1.119
3.879AspThr: 3.879 ± 1.239
3.581AspVal: 3.581 ± 0.56
0.298AspTrp: 0.298 ± 0.262
3.283AspTyr: 3.283 ± 0.926
0.0AspXaa: 0.0 ± 0.0
Glu
4.178GluAla: 4.178 ± 1.258
0.895GluCys: 0.895 ± 0.478
5.968GluAsp: 5.968 ± 1.704
4.775GluGlu: 4.775 ± 0.952
3.581GluPhe: 3.581 ± 0.921
2.089GluGly: 2.089 ± 0.771
1.194GluHis: 1.194 ± 0.487
7.759GluIle: 7.759 ± 1.136
10.445GluLys: 10.445 ± 1.811
10.445GluLeu: 10.445 ± 1.724
1.492GluMet: 1.492 ± 0.506
6.267GluAsn: 6.267 ± 1.139
0.597GluPro: 0.597 ± 0.391
5.073GluGln: 5.073 ± 1.019
3.879GluArg: 3.879 ± 1.074
4.775GluSer: 4.775 ± 1.077
4.476GluThr: 4.476 ± 1.053
2.984GluVal: 2.984 ± 1.058
0.895GluTrp: 0.895 ± 0.465
4.178GluTyr: 4.178 ± 0.968
0.0GluXaa: 0.0 ± 0.0
Phe
0.597PheAla: 0.597 ± 0.371
0.298PheCys: 0.298 ± 0.262
4.178PheAsp: 4.178 ± 0.832
5.372PheGlu: 5.372 ± 1.14
4.476PhePhe: 4.476 ± 1.322
2.387PheGly: 2.387 ± 0.596
1.492PheHis: 1.492 ± 0.575
2.089PheIle: 2.089 ± 0.693
4.775PheLys: 4.775 ± 1.322
5.968PheLeu: 5.968 ± 0.992
0.298PheMet: 0.298 ± 0.407
2.387PheAsn: 2.387 ± 0.861
1.194PhePro: 1.194 ± 0.641
1.492PheGln: 1.492 ± 0.86
1.492PheArg: 1.492 ± 0.6
5.073PheSer: 5.073 ± 1.21
2.089PheThr: 2.089 ± 0.876
4.178PheVal: 4.178 ± 1.057
0.298PheTrp: 0.298 ± 0.278
1.791PheTyr: 1.791 ± 0.559
0.0PheXaa: 0.0 ± 0.0
Gly
1.791GlyAla: 1.791 ± 0.725
0.597GlyCys: 0.597 ± 0.364
1.791GlyAsp: 1.791 ± 0.937
2.089GlyGlu: 2.089 ± 0.662
2.387GlyPhe: 2.387 ± 0.836
1.492GlyGly: 1.492 ± 1.171
0.597GlyHis: 0.597 ± 0.367
2.984GlyIle: 2.984 ± 0.68
5.073GlyLys: 5.073 ± 1.803
4.178GlyLeu: 4.178 ± 1.208
0.895GlyMet: 0.895 ± 0.466
2.387GlyAsn: 2.387 ± 0.738
0.0GlyPro: 0.0 ± 0.0
1.791GlyGln: 1.791 ± 0.506
1.492GlyArg: 1.492 ± 0.536
2.984GlySer: 2.984 ± 1.379
3.879GlyThr: 3.879 ± 0.858
2.387GlyVal: 2.387 ± 0.81
0.597GlyTrp: 0.597 ± 0.399
3.283GlyTyr: 3.283 ± 0.832
0.0GlyXaa: 0.0 ± 0.0
His
2.686HisAla: 2.686 ± 1.149
0.0HisCys: 0.0 ± 0.0
0.597HisAsp: 0.597 ± 0.378
0.298HisGlu: 0.298 ± 0.343
0.597HisPhe: 0.597 ± 0.336
0.0HisGly: 0.0 ± 0.0
0.298HisHis: 0.298 ± 0.312
1.492HisIle: 1.492 ± 0.713
1.492HisLys: 1.492 ± 0.576
1.194HisLeu: 1.194 ± 0.58
0.0HisMet: 0.0 ± 0.0
0.597HisAsn: 0.597 ± 0.428
0.597HisPro: 0.597 ± 0.336
2.089HisGln: 2.089 ± 0.585
0.298HisArg: 0.298 ± 0.295
0.895HisSer: 0.895 ± 0.571
2.686HisThr: 2.686 ± 0.855
0.298HisVal: 0.298 ± 0.309
0.0HisTrp: 0.0 ± 0.0
0.895HisTyr: 0.895 ± 0.445
0.0HisXaa: 0.0 ± 0.0
Ile
1.492IleAla: 1.492 ± 0.487
0.0IleCys: 0.0 ± 0.0
4.178IleAsp: 4.178 ± 1.325
8.057IleGlu: 8.057 ± 2.189
3.879IlePhe: 3.879 ± 1.176
2.984IleGly: 2.984 ± 0.729
1.194IleHis: 1.194 ± 0.49
5.67IleIle: 5.67 ± 1.017
7.759IleLys: 7.759 ± 1.425
9.549IleLeu: 9.549 ± 2.17
0.895IleMet: 0.895 ± 0.447
6.864IleAsn: 6.864 ± 1.373
2.387IlePro: 2.387 ± 0.671
3.879IleGln: 3.879 ± 1.069
1.791IleArg: 1.791 ± 0.493
5.372IleSer: 5.372 ± 1.212
2.686IleThr: 2.686 ± 0.792
2.089IleVal: 2.089 ± 0.76
0.0IleTrp: 0.0 ± 0.0
2.686IleTyr: 2.686 ± 0.737
0.0IleXaa: 0.0 ± 0.0
Lys
4.775LysAla: 4.775 ± 0.857
0.597LysCys: 0.597 ± 0.382
5.67LysAsp: 5.67 ± 1.002
11.34LysGlu: 11.34 ± 1.128
3.581LysPhe: 3.581 ± 1.165
5.67LysGly: 5.67 ± 1.434
2.387LysHis: 2.387 ± 0.777
8.953LysIle: 8.953 ± 1.427
8.356LysLys: 8.356 ± 1.836
6.864LysLeu: 6.864 ± 1.138
2.984LysMet: 2.984 ± 1.1
4.775LysAsn: 4.775 ± 1.353
1.492LysPro: 1.492 ± 0.542
5.372LysGln: 5.372 ± 1.313
6.267LysArg: 6.267 ± 0.933
7.759LysSer: 7.759 ± 1.437
7.162LysThr: 7.162 ± 1.63
5.968LysVal: 5.968 ± 1.122
0.597LysTrp: 0.597 ± 0.384
2.686LysTyr: 2.686 ± 0.687
0.0LysXaa: 0.0 ± 0.0
Leu
5.073LeuAla: 5.073 ± 1.226
0.895LeuCys: 0.895 ± 0.6
8.356LeuAsp: 8.356 ± 1.192
10.146LeuGlu: 10.146 ± 1.586
5.968LeuPhe: 5.968 ± 1.634
5.372LeuGly: 5.372 ± 1.096
2.387LeuHis: 2.387 ± 0.98
6.565LeuIle: 6.565 ± 1.598
9.251LeuLys: 9.251 ± 1.336
7.46LeuLeu: 7.46 ± 1.383
2.089LeuMet: 2.089 ± 0.574
5.67LeuAsn: 5.67 ± 1.428
3.283LeuPro: 3.283 ± 1.156
4.178LeuGln: 4.178 ± 0.749
3.581LeuArg: 3.581 ± 0.758
7.46LeuSer: 7.46 ± 1.208
5.372LeuThr: 5.372 ± 1.217
4.476LeuVal: 4.476 ± 1.302
0.597LeuTrp: 0.597 ± 0.372
2.089LeuTyr: 2.089 ± 0.814
0.0LeuXaa: 0.0 ± 0.0
Met
0.597MetAla: 0.597 ± 0.46
0.0MetCys: 0.0 ± 0.0
1.791MetAsp: 1.791 ± 0.536
3.283MetGlu: 3.283 ± 1.052
0.895MetPhe: 0.895 ± 0.524
0.298MetGly: 0.298 ± 0.371
0.298MetHis: 0.298 ± 0.312
1.194MetIle: 1.194 ± 0.415
1.791MetLys: 1.791 ± 0.544
1.791MetLeu: 1.791 ± 0.581
0.0MetMet: 0.0 ± 0.0
1.791MetAsn: 1.791 ± 0.837
0.298MetPro: 0.298 ± 0.285
0.298MetGln: 0.298 ± 0.295
1.194MetArg: 1.194 ± 0.599
0.597MetSer: 0.597 ± 0.444
2.387MetThr: 2.387 ± 1.021
1.194MetVal: 1.194 ± 0.52
0.0MetTrp: 0.0 ± 0.0
0.895MetTyr: 0.895 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
2.686AsnAla: 2.686 ± 0.699
0.298AsnCys: 0.298 ± 0.278
2.387AsnAsp: 2.387 ± 0.694
3.879AsnGlu: 3.879 ± 1.218
3.879AsnPhe: 3.879 ± 1.328
3.879AsnGly: 3.879 ± 1.195
0.597AsnHis: 0.597 ± 0.433
3.879AsnIle: 3.879 ± 1.445
5.968AsnLys: 5.968 ± 1.012
5.67AsnLeu: 5.67 ± 1.007
1.194AsnMet: 1.194 ± 0.487
4.476AsnAsn: 4.476 ± 0.802
1.791AsnPro: 1.791 ± 0.482
2.984AsnGln: 2.984 ± 1.049
2.089AsnArg: 2.089 ± 0.766
3.879AsnSer: 3.879 ± 0.961
3.879AsnThr: 3.879 ± 1.188
2.089AsnVal: 2.089 ± 0.969
0.597AsnTrp: 0.597 ± 0.4
3.581AsnTyr: 3.581 ± 0.923
0.0AsnXaa: 0.0 ± 0.0
Pro
0.298ProAla: 0.298 ± 0.278
0.0ProCys: 0.0 ± 0.0
1.791ProAsp: 1.791 ± 0.701
2.089ProGlu: 2.089 ± 0.71
1.492ProPhe: 1.492 ± 0.583
0.597ProGly: 0.597 ± 0.336
0.0ProHis: 0.0 ± 0.0
1.194ProIle: 1.194 ± 0.552
3.283ProLys: 3.283 ± 1.127
1.194ProLeu: 1.194 ± 0.564
0.0ProMet: 0.0 ± 0.0
0.895ProAsn: 0.895 ± 0.445
0.597ProPro: 0.597 ± 0.425
0.597ProGln: 0.597 ± 0.687
0.895ProArg: 0.895 ± 0.466
0.895ProSer: 0.895 ± 0.397
1.791ProThr: 1.791 ± 0.775
0.597ProVal: 0.597 ± 0.364
0.0ProTrp: 0.0 ± 0.0
1.492ProTyr: 1.492 ± 0.486
0.0ProXaa: 0.0 ± 0.0
Gln
5.073GlnAla: 5.073 ± 1.89
0.597GlnCys: 0.597 ± 0.399
2.686GlnAsp: 2.686 ± 0.717
4.775GlnGlu: 4.775 ± 0.877
1.791GlnPhe: 1.791 ± 0.674
0.597GlnGly: 0.597 ± 0.476
1.492GlnHis: 1.492 ± 0.653
2.089GlnIle: 2.089 ± 0.902
4.476GlnLys: 4.476 ± 1.09
3.879GlnLeu: 3.879 ± 0.922
1.194GlnMet: 1.194 ± 0.632
1.194GlnAsn: 1.194 ± 0.47
0.0GlnPro: 0.0 ± 0.0
1.791GlnGln: 1.791 ± 0.654
3.581GlnArg: 3.581 ± 0.827
1.194GlnSer: 1.194 ± 0.429
2.387GlnThr: 2.387 ± 0.898
3.283GlnVal: 3.283 ± 1.16
0.298GlnTrp: 0.298 ± 0.309
2.387GlnTyr: 2.387 ± 0.622
0.0GlnXaa: 0.0 ± 0.0
Arg
1.492ArgAla: 1.492 ± 0.713
0.0ArgCys: 0.0 ± 0.0
2.387ArgAsp: 2.387 ± 0.671
2.984ArgGlu: 2.984 ± 0.788
2.686ArgPhe: 2.686 ± 0.794
1.194ArgGly: 1.194 ± 0.678
0.298ArgHis: 0.298 ± 0.278
2.984ArgIle: 2.984 ± 0.994
6.565ArgLys: 6.565 ± 1.125
3.879ArgLeu: 3.879 ± 0.898
0.895ArgMet: 0.895 ± 0.444
2.686ArgAsn: 2.686 ± 0.754
0.597ArgPro: 0.597 ± 0.319
3.581ArgGln: 3.581 ± 0.905
2.387ArgArg: 2.387 ± 0.865
2.686ArgSer: 2.686 ± 0.865
2.387ArgThr: 2.387 ± 0.888
1.194ArgVal: 1.194 ± 0.505
0.0ArgTrp: 0.0 ± 0.0
3.283ArgTyr: 3.283 ± 1.257
0.0ArgXaa: 0.0 ± 0.0
Ser
1.791SerAla: 1.791 ± 0.661
0.298SerCys: 0.298 ± 0.273
4.476SerAsp: 4.476 ± 0.637
5.67SerGlu: 5.67 ± 1.101
3.283SerPhe: 3.283 ± 0.919
3.581SerGly: 3.581 ± 0.983
1.492SerHis: 1.492 ± 0.664
3.581SerIle: 3.581 ± 1.315
9.251SerLys: 9.251 ± 2.149
7.162SerLeu: 7.162 ± 1.382
2.089SerMet: 2.089 ± 0.789
2.387SerAsn: 2.387 ± 0.848
1.492SerPro: 1.492 ± 0.645
3.283SerGln: 3.283 ± 1.03
1.492SerArg: 1.492 ± 0.576
4.775SerSer: 4.775 ± 1.245
3.283SerThr: 3.283 ± 0.867
3.879SerVal: 3.879 ± 1.083
0.895SerTrp: 0.895 ± 0.502
2.089SerTyr: 2.089 ± 0.694
0.0SerXaa: 0.0 ± 0.0
Thr
2.686ThrAla: 2.686 ± 0.828
0.0ThrCys: 0.0 ± 0.0
2.387ThrAsp: 2.387 ± 0.794
5.968ThrGlu: 5.968 ± 1.505
2.089ThrPhe: 2.089 ± 0.775
5.073ThrGly: 5.073 ± 0.728
0.895ThrHis: 0.895 ± 0.394
6.864ThrIle: 6.864 ± 1.045
4.476ThrLys: 4.476 ± 1.14
6.864ThrLeu: 6.864 ± 1.162
0.895ThrMet: 0.895 ± 0.591
4.178ThrAsn: 4.178 ± 0.95
0.895ThrPro: 0.895 ± 0.483
2.387ThrGln: 2.387 ± 0.852
3.879ThrArg: 3.879 ± 1.493
1.492ThrSer: 1.492 ± 1.003
3.581ThrThr: 3.581 ± 0.919
4.178ThrVal: 4.178 ± 1.033
0.895ThrTrp: 0.895 ± 0.461
2.387ThrTyr: 2.387 ± 1.417
0.0ThrXaa: 0.0 ± 0.0
Val
2.984ValAla: 2.984 ± 0.907
0.298ValCys: 0.298 ± 0.262
3.879ValAsp: 3.879 ± 1.436
4.178ValGlu: 4.178 ± 1.263
0.895ValPhe: 0.895 ± 0.534
0.597ValGly: 0.597 ± 0.403
0.895ValHis: 0.895 ± 0.606
3.283ValIle: 3.283 ± 0.75
3.581ValLys: 3.581 ± 1.234
4.178ValLeu: 4.178 ± 0.958
0.597ValMet: 0.597 ± 0.401
3.581ValAsn: 3.581 ± 0.777
1.492ValPro: 1.492 ± 0.601
1.194ValGln: 1.194 ± 0.628
2.089ValArg: 2.089 ± 0.663
5.67ValSer: 5.67 ± 1.281
3.879ValThr: 3.879 ± 1.05
2.984ValVal: 2.984 ± 0.658
0.0ValTrp: 0.0 ± 0.0
2.686ValTyr: 2.686 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.895TrpAla: 0.895 ± 0.444
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.791TrpGlu: 1.791 ± 0.711
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.895TrpIle: 0.895 ± 0.436
0.298TrpLys: 0.298 ± 0.273
0.895TrpLeu: 0.895 ± 0.401
0.597TrpMet: 0.597 ± 0.409
0.298TrpAsn: 0.298 ± 0.309
0.0TrpPro: 0.0 ± 0.0
0.298TrpGln: 0.298 ± 0.278
0.0TrpArg: 0.0 ± 0.0
0.298TrpSer: 0.298 ± 0.278
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.597TyrAla: 0.597 ± 0.399
0.0TyrCys: 0.0 ± 0.0
2.089TyrAsp: 2.089 ± 0.668
3.283TyrGlu: 3.283 ± 0.839
2.984TyrPhe: 2.984 ± 1.016
0.895TyrGly: 0.895 ± 0.377
0.597TyrHis: 0.597 ± 0.357
3.283TyrIle: 3.283 ± 0.897
4.476TyrLys: 4.476 ± 1.138
7.759TyrLeu: 7.759 ± 1.054
0.895TyrMet: 0.895 ± 0.552
2.387TyrAsn: 2.387 ± 0.697
0.895TyrPro: 0.895 ± 0.461
2.387TyrGln: 2.387 ± 0.831
2.984TyrArg: 2.984 ± 1.419
4.178TyrSer: 4.178 ± 0.969
1.492TyrThr: 1.492 ± 0.586
1.194TyrVal: 1.194 ± 0.525
0.298TyrTrp: 0.298 ± 0.262
2.089TyrTyr: 2.089 ± 0.974
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski