Amino acid dipepetide frequency for Streptococcus phage Javan74

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.44AlaAla: 4.44 ± 0.668
0.419AlaCys: 0.419 ± 0.181
5.446AlaAsp: 5.446 ± 0.659
6.954AlaGlu: 6.954 ± 0.772
2.681AlaPhe: 2.681 ± 0.714
4.021AlaGly: 4.021 ± 0.714
1.592AlaHis: 1.592 ± 0.498
5.781AlaIle: 5.781 ± 0.727
5.529AlaLys: 5.529 ± 0.76
6.619AlaLeu: 6.619 ± 0.802
1.34AlaMet: 1.34 ± 0.283
4.189AlaAsn: 4.189 ± 0.555
2.095AlaPro: 2.095 ± 0.391
3.1AlaGln: 3.1 ± 0.555
2.597AlaArg: 2.597 ± 0.466
2.849AlaSer: 2.849 ± 0.438
4.524AlaThr: 4.524 ± 0.522
4.692AlaVal: 4.692 ± 0.909
0.754AlaTrp: 0.754 ± 0.231
2.262AlaTyr: 2.262 ± 0.531
0.0AlaXaa: 0.0 ± 0.0
Cys
0.419CysAla: 0.419 ± 0.188
0.0CysCys: 0.0 ± 0.0
0.586CysAsp: 0.586 ± 0.362
0.503CysGlu: 0.503 ± 0.182
0.251CysPhe: 0.251 ± 0.136
0.335CysGly: 0.335 ± 0.168
0.168CysHis: 0.168 ± 0.109
0.251CysIle: 0.251 ± 0.128
0.754CysLys: 0.754 ± 0.253
0.335CysLeu: 0.335 ± 0.183
0.168CysMet: 0.168 ± 0.114
0.251CysAsn: 0.251 ± 0.143
0.084CysPro: 0.084 ± 0.084
0.168CysGln: 0.168 ± 0.133
0.335CysArg: 0.335 ± 0.186
0.335CysSer: 0.335 ± 0.151
0.168CysThr: 0.168 ± 0.109
0.084CysVal: 0.084 ± 0.084
0.0CysTrp: 0.0 ± 0.0
0.084CysTyr: 0.084 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
3.519AspAla: 3.519 ± 0.766
0.838AspCys: 0.838 ± 0.259
4.021AspAsp: 4.021 ± 0.708
4.859AspGlu: 4.859 ± 0.668
2.513AspPhe: 2.513 ± 0.495
4.44AspGly: 4.44 ± 0.687
0.754AspHis: 0.754 ± 0.245
3.016AspIle: 3.016 ± 0.402
5.613AspLys: 5.613 ± 0.562
5.194AspLeu: 5.194 ± 0.529
1.843AspMet: 1.843 ± 0.397
4.775AspAsn: 4.775 ± 0.6
2.178AspPro: 2.178 ± 0.425
1.424AspGln: 1.424 ± 0.36
2.43AspArg: 2.43 ± 0.474
3.854AspSer: 3.854 ± 0.546
3.603AspThr: 3.603 ± 0.512
4.021AspVal: 4.021 ± 0.452
1.843AspTrp: 1.843 ± 0.363
3.351AspTyr: 3.351 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
5.362GluAla: 5.362 ± 0.707
0.419GluCys: 0.419 ± 0.207
3.351GluAsp: 3.351 ± 0.565
5.697GluGlu: 5.697 ± 0.838
3.351GluPhe: 3.351 ± 0.578
4.357GluGly: 4.357 ± 0.611
1.508GluHis: 1.508 ± 0.321
6.116GluIle: 6.116 ± 0.66
6.702GluLys: 6.702 ± 0.841
10.137GluLeu: 10.137 ± 1.143
2.597GluMet: 2.597 ± 0.533
4.524GluAsn: 4.524 ± 0.722
1.592GluPro: 1.592 ± 0.351
4.692GluGln: 4.692 ± 0.777
4.775GluArg: 4.775 ± 0.644
3.686GluSer: 3.686 ± 0.438
3.603GluThr: 3.603 ± 0.51
3.686GluVal: 3.686 ± 0.547
0.754GluTrp: 0.754 ± 0.189
2.765GluTyr: 2.765 ± 0.534
0.0GluXaa: 0.0 ± 0.0
Phe
3.016PheAla: 3.016 ± 0.486
0.168PheCys: 0.168 ± 0.118
3.016PheAsp: 3.016 ± 0.492
3.519PheGlu: 3.519 ± 0.456
1.005PhePhe: 1.005 ± 0.285
2.178PheGly: 2.178 ± 0.375
0.084PheHis: 0.084 ± 0.08
2.095PheIle: 2.095 ± 0.425
3.603PheLys: 3.603 ± 0.723
2.765PheLeu: 2.765 ± 0.415
1.424PheMet: 1.424 ± 0.308
2.849PheAsn: 2.849 ± 0.579
1.592PhePro: 1.592 ± 0.486
0.67PheGln: 0.67 ± 0.198
2.178PheArg: 2.178 ± 0.39
3.1PheSer: 3.1 ± 0.604
2.262PheThr: 2.262 ± 0.507
2.513PheVal: 2.513 ± 0.451
0.251PheTrp: 0.251 ± 0.149
1.173PheTyr: 1.173 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
3.854GlyAla: 3.854 ± 0.602
0.168GlyCys: 0.168 ± 0.11
3.77GlyAsp: 3.77 ± 0.69
4.44GlyGlu: 4.44 ± 0.65
2.346GlyPhe: 2.346 ± 0.475
3.854GlyGly: 3.854 ± 0.628
0.754GlyHis: 0.754 ± 0.228
5.111GlyIle: 5.111 ± 0.796
4.859GlyLys: 4.859 ± 0.704
5.781GlyLeu: 5.781 ± 0.855
2.011GlyMet: 2.011 ± 0.413
2.681GlyAsn: 2.681 ± 0.52
0.754GlyPro: 0.754 ± 0.254
3.686GlyGln: 3.686 ± 0.603
3.267GlyArg: 3.267 ± 0.524
3.77GlySer: 3.77 ± 0.681
3.351GlyThr: 3.351 ± 0.564
4.859GlyVal: 4.859 ± 0.635
1.508GlyTrp: 1.508 ± 0.364
2.513GlyTyr: 2.513 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
1.257HisAla: 1.257 ± 0.304
0.084HisCys: 0.084 ± 0.085
0.922HisAsp: 0.922 ± 0.219
1.005HisGlu: 1.005 ± 0.238
1.005HisPhe: 1.005 ± 0.276
1.089HisGly: 1.089 ± 0.251
0.419HisHis: 0.419 ± 0.192
0.586HisIle: 0.586 ± 0.202
1.424HisLys: 1.424 ± 0.372
1.34HisLeu: 1.34 ± 0.382
0.251HisMet: 0.251 ± 0.124
0.586HisAsn: 0.586 ± 0.224
0.586HisPro: 0.586 ± 0.19
0.419HisGln: 0.419 ± 0.202
0.503HisArg: 0.503 ± 0.201
0.586HisSer: 0.586 ± 0.174
1.089HisThr: 1.089 ± 0.3
1.257HisVal: 1.257 ± 0.37
0.168HisTrp: 0.168 ± 0.125
0.251HisTyr: 0.251 ± 0.133
0.0HisXaa: 0.0 ± 0.0
Ile
5.111IleAla: 5.111 ± 0.713
0.586IleCys: 0.586 ± 0.186
5.278IleAsp: 5.278 ± 0.643
5.529IleGlu: 5.529 ± 0.685
2.346IlePhe: 2.346 ± 0.47
3.938IleGly: 3.938 ± 0.962
0.754IleHis: 0.754 ± 0.232
4.189IleIle: 4.189 ± 0.43
6.032IleLys: 6.032 ± 0.595
4.273IleLeu: 4.273 ± 0.706
1.005IleMet: 1.005 ± 0.329
3.267IleAsn: 3.267 ± 0.599
1.424IlePro: 1.424 ± 0.415
2.597IleGln: 2.597 ± 0.485
2.849IleArg: 2.849 ± 0.548
3.686IleSer: 3.686 ± 0.695
4.357IleThr: 4.357 ± 0.936
3.938IleVal: 3.938 ± 0.626
0.419IleTrp: 0.419 ± 0.187
3.351IleTyr: 3.351 ± 0.582
0.0IleXaa: 0.0 ± 0.0
Lys
6.284LysAla: 6.284 ± 0.556
0.084LysCys: 0.084 ± 0.1
5.362LysAsp: 5.362 ± 0.506
6.367LysGlu: 6.367 ± 0.79
3.016LysPhe: 3.016 ± 0.438
4.357LysGly: 4.357 ± 0.496
1.005LysHis: 1.005 ± 0.27
5.027LysIle: 5.027 ± 0.701
7.205LysLys: 7.205 ± 1.026
6.954LysLeu: 6.954 ± 0.747
1.927LysMet: 1.927 ± 0.357
4.021LysAsn: 4.021 ± 0.608
2.011LysPro: 2.011 ± 0.418
3.435LysGln: 3.435 ± 0.577
3.854LysArg: 3.854 ± 0.583
5.362LysSer: 5.362 ± 0.87
6.2LysThr: 6.2 ± 0.794
5.027LysVal: 5.027 ± 0.681
1.34LysTrp: 1.34 ± 0.322
2.346LysTyr: 2.346 ± 0.392
0.0LysXaa: 0.0 ± 0.0
Leu
7.54LeuAla: 7.54 ± 0.727
0.586LeuCys: 0.586 ± 0.224
6.535LeuAsp: 6.535 ± 0.785
7.875LeuGlu: 7.875 ± 0.768
3.184LeuPhe: 3.184 ± 0.629
5.613LeuGly: 5.613 ± 0.982
1.508LeuHis: 1.508 ± 0.312
3.77LeuIle: 3.77 ± 0.641
7.875LeuLys: 7.875 ± 0.663
6.284LeuLeu: 6.284 ± 0.75
1.257LeuMet: 1.257 ± 0.284
4.608LeuAsn: 4.608 ± 0.549
3.435LeuPro: 3.435 ± 0.629
3.603LeuGln: 3.603 ± 0.465
4.189LeuArg: 4.189 ± 0.628
5.194LeuSer: 5.194 ± 0.669
4.859LeuThr: 4.859 ± 0.649
4.692LeuVal: 4.692 ± 0.624
0.586LeuTrp: 0.586 ± 0.185
2.095LeuTyr: 2.095 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
1.508MetAla: 1.508 ± 0.361
0.0MetCys: 0.0 ± 0.0
1.257MetAsp: 1.257 ± 0.362
1.592MetGlu: 1.592 ± 0.353
0.586MetPhe: 0.586 ± 0.21
1.424MetGly: 1.424 ± 0.307
0.084MetHis: 0.084 ± 0.076
1.257MetIle: 1.257 ± 0.347
1.34MetLys: 1.34 ± 0.285
2.346MetLeu: 2.346 ± 0.436
0.084MetMet: 0.084 ± 0.07
1.089MetAsn: 1.089 ± 0.297
0.922MetPro: 0.922 ± 0.266
1.089MetGln: 1.089 ± 0.266
1.676MetArg: 1.676 ± 0.344
1.676MetSer: 1.676 ± 0.346
1.676MetThr: 1.676 ± 0.339
0.838MetVal: 0.838 ± 0.254
0.335MetTrp: 0.335 ± 0.195
1.089MetTyr: 1.089 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
4.189AsnAla: 4.189 ± 0.904
0.084AsnCys: 0.084 ± 0.07
3.351AsnAsp: 3.351 ± 0.471
3.1AsnGlu: 3.1 ± 0.633
1.843AsnPhe: 1.843 ± 0.348
4.44AsnGly: 4.44 ± 0.55
0.67AsnHis: 0.67 ± 0.22
3.519AsnIle: 3.519 ± 0.617
3.854AsnLys: 3.854 ± 0.513
5.194AsnLeu: 5.194 ± 0.832
1.34AsnMet: 1.34 ± 0.317
2.765AsnAsn: 2.765 ± 0.442
2.011AsnPro: 2.011 ± 0.468
2.262AsnGln: 2.262 ± 0.537
2.932AsnArg: 2.932 ± 0.6
3.016AsnSer: 3.016 ± 0.393
3.016AsnThr: 3.016 ± 0.493
3.519AsnVal: 3.519 ± 0.491
1.005AsnTrp: 1.005 ± 0.262
2.597AsnTyr: 2.597 ± 0.506
0.0AsnXaa: 0.0 ± 0.0
Pro
2.262ProAla: 2.262 ± 0.481
0.251ProCys: 0.251 ± 0.16
2.262ProAsp: 2.262 ± 0.385
2.765ProGlu: 2.765 ± 0.469
1.257ProPhe: 1.257 ± 0.324
1.34ProGly: 1.34 ± 0.358
0.168ProHis: 0.168 ± 0.115
1.676ProIle: 1.676 ± 0.446
2.597ProLys: 2.597 ± 0.466
2.43ProLeu: 2.43 ± 0.432
0.419ProMet: 0.419 ± 0.197
1.424ProAsn: 1.424 ± 0.391
0.754ProPro: 0.754 ± 0.206
2.011ProGln: 2.011 ± 0.657
1.424ProArg: 1.424 ± 0.431
1.843ProSer: 1.843 ± 0.432
1.508ProThr: 1.508 ± 0.363
2.011ProVal: 2.011 ± 0.548
0.251ProTrp: 0.251 ± 0.155
2.262ProTyr: 2.262 ± 0.543
0.0ProXaa: 0.0 ± 0.0
Gln
3.938GlnAla: 3.938 ± 0.51
0.168GlnCys: 0.168 ± 0.121
1.005GlnAsp: 1.005 ± 0.293
3.351GlnGlu: 3.351 ± 0.594
1.759GlnPhe: 1.759 ± 0.419
2.597GlnGly: 2.597 ± 0.469
0.922GlnHis: 0.922 ± 0.212
3.519GlnIle: 3.519 ± 0.396
4.357GlnLys: 4.357 ± 0.461
3.351GlnLeu: 3.351 ± 0.48
1.173GlnMet: 1.173 ± 0.333
1.759GlnAsn: 1.759 ± 0.498
1.759GlnPro: 1.759 ± 0.512
3.603GlnGln: 3.603 ± 0.691
1.676GlnArg: 1.676 ± 0.324
3.351GlnSer: 3.351 ± 0.513
2.513GlnThr: 2.513 ± 0.44
2.513GlnVal: 2.513 ± 0.422
0.503GlnTrp: 0.503 ± 0.171
1.34GlnTyr: 1.34 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
2.765ArgAla: 2.765 ± 0.335
0.168ArgCys: 0.168 ± 0.153
2.095ArgAsp: 2.095 ± 0.428
3.854ArgGlu: 3.854 ± 0.536
2.262ArgPhe: 2.262 ± 0.482
2.765ArgGly: 2.765 ± 0.493
1.089ArgHis: 1.089 ± 0.322
3.854ArgIle: 3.854 ± 0.652
4.105ArgLys: 4.105 ± 0.658
3.77ArgLeu: 3.77 ± 0.618
1.173ArgMet: 1.173 ± 0.334
2.178ArgAsn: 2.178 ± 0.457
1.508ArgPro: 1.508 ± 0.37
1.927ArgGln: 1.927 ± 0.402
1.843ArgArg: 1.843 ± 0.497
2.346ArgSer: 2.346 ± 0.399
3.1ArgThr: 3.1 ± 0.527
3.016ArgVal: 3.016 ± 0.49
0.419ArgTrp: 0.419 ± 0.168
2.011ArgTyr: 2.011 ± 0.463
0.0ArgXaa: 0.0 ± 0.0
Ser
5.027SerAla: 5.027 ± 0.779
0.084SerCys: 0.084 ± 0.079
3.519SerAsp: 3.519 ± 0.59
4.692SerGlu: 4.692 ± 0.803
2.765SerPhe: 2.765 ± 0.587
4.608SerGly: 4.608 ± 0.721
0.754SerHis: 0.754 ± 0.248
4.273SerIle: 4.273 ± 0.566
4.357SerLys: 4.357 ± 0.462
4.105SerLeu: 4.105 ± 0.561
1.257SerMet: 1.257 ± 0.402
3.938SerAsn: 3.938 ± 0.521
1.759SerPro: 1.759 ± 0.346
2.43SerGln: 2.43 ± 0.477
2.262SerArg: 2.262 ± 0.473
2.513SerSer: 2.513 ± 0.648
3.351SerThr: 3.351 ± 0.524
3.1SerVal: 3.1 ± 0.502
1.005SerTrp: 1.005 ± 0.266
3.016SerTyr: 3.016 ± 0.506
0.0SerXaa: 0.0 ± 0.0
Thr
4.273ThrAla: 4.273 ± 0.783
0.251ThrCys: 0.251 ± 0.154
4.105ThrAsp: 4.105 ± 0.562
5.027ThrGlu: 5.027 ± 0.737
2.597ThrPhe: 2.597 ± 0.441
4.943ThrGly: 4.943 ± 0.782
0.922ThrHis: 0.922 ± 0.267
3.77ThrIle: 3.77 ± 0.536
2.597ThrLys: 2.597 ± 0.501
4.692ThrLeu: 4.692 ± 0.446
0.838ThrMet: 0.838 ± 0.289
2.932ThrAsn: 2.932 ± 0.55
2.597ThrPro: 2.597 ± 0.704
2.765ThrGln: 2.765 ± 0.489
2.178ThrArg: 2.178 ± 0.404
3.184ThrSer: 3.184 ± 0.505
3.854ThrThr: 3.854 ± 0.594
4.021ThrVal: 4.021 ± 0.567
0.251ThrTrp: 0.251 ± 0.129
2.932ThrTyr: 2.932 ± 0.563
0.0ThrXaa: 0.0 ± 0.0
Val
4.273ValAla: 4.273 ± 0.663
0.586ValCys: 0.586 ± 0.226
3.854ValAsp: 3.854 ± 0.438
4.273ValGlu: 4.273 ± 0.547
2.178ValPhe: 2.178 ± 0.486
4.105ValGly: 4.105 ± 0.793
0.586ValHis: 0.586 ± 0.284
3.519ValIle: 3.519 ± 0.509
4.357ValLys: 4.357 ± 0.534
6.2ValLeu: 6.2 ± 0.779
1.005ValMet: 1.005 ± 0.332
3.603ValAsn: 3.603 ± 0.515
1.927ValPro: 1.927 ± 0.38
2.43ValGln: 2.43 ± 0.39
2.178ValArg: 2.178 ± 0.426
5.027ValSer: 5.027 ± 0.576
3.351ValThr: 3.351 ± 0.591
4.357ValVal: 4.357 ± 0.802
0.754ValTrp: 0.754 ± 0.275
2.597ValTyr: 2.597 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.586TrpAla: 0.586 ± 0.225
0.168TrpCys: 0.168 ± 0.118
1.34TrpAsp: 1.34 ± 0.439
1.34TrpGlu: 1.34 ± 0.356
1.173TrpPhe: 1.173 ± 0.572
0.586TrpGly: 0.586 ± 0.251
0.251TrpHis: 0.251 ± 0.139
0.838TrpIle: 0.838 ± 0.277
1.089TrpLys: 1.089 ± 0.311
0.922TrpLeu: 0.922 ± 0.3
0.084TrpMet: 0.084 ± 0.083
0.838TrpAsn: 0.838 ± 0.307
0.251TrpPro: 0.251 ± 0.132
0.586TrpGln: 0.586 ± 0.182
0.503TrpArg: 0.503 ± 0.189
0.67TrpSer: 0.67 ± 0.226
0.419TrpThr: 0.419 ± 0.199
0.67TrpVal: 0.67 ± 0.197
0.084TrpTrp: 0.084 ± 0.084
0.335TrpTyr: 0.335 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.262TyrAla: 2.262 ± 0.438
0.168TyrCys: 0.168 ± 0.12
3.016TyrAsp: 3.016 ± 0.604
2.932TyrGlu: 2.932 ± 0.557
1.34TyrPhe: 1.34 ± 0.302
2.346TyrGly: 2.346 ± 0.66
0.838TyrHis: 0.838 ± 0.236
2.765TyrIle: 2.765 ± 0.431
2.932TyrLys: 2.932 ± 0.536
2.597TyrLeu: 2.597 ± 0.442
0.586TyrMet: 0.586 ± 0.221
2.43TyrAsn: 2.43 ± 0.519
1.676TyrPro: 1.676 ± 0.314
2.262TyrGln: 2.262 ± 0.57
2.681TyrArg: 2.681 ± 0.6
2.765TyrSer: 2.765 ± 0.429
1.927TyrThr: 1.927 ± 0.401
2.346TyrVal: 2.346 ± 0.485
0.503TyrTrp: 0.503 ± 0.197
1.34TyrTyr: 1.34 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski