Amino acid dipepetide frequency for Streptococcus phage Javan372

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.429AlaAla: 4.429 ± 0.769
0.206AlaCys: 0.206 ± 0.148
4.12AlaAsp: 4.12 ± 0.61
5.047AlaGlu: 5.047 ± 0.879
2.266AlaPhe: 2.266 ± 0.409
3.399AlaGly: 3.399 ± 0.621
0.412AlaHis: 0.412 ± 0.202
5.253AlaIle: 5.253 ± 0.762
4.738AlaLys: 4.738 ± 0.601
7.313AlaLeu: 7.313 ± 0.844
2.163AlaMet: 2.163 ± 0.425
4.738AlaAsn: 4.738 ± 0.769
1.339AlaPro: 1.339 ± 0.416
2.884AlaGln: 2.884 ± 0.769
2.06AlaArg: 2.06 ± 0.473
4.12AlaSer: 4.12 ± 0.514
3.399AlaThr: 3.399 ± 0.642
4.738AlaVal: 4.738 ± 0.777
0.824AlaTrp: 0.824 ± 0.287
2.884AlaTyr: 2.884 ± 0.628
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.253
0.0CysCys: 0.0 ± 0.0
0.309CysAsp: 0.309 ± 0.183
0.412CysGlu: 0.412 ± 0.227
0.515CysPhe: 0.515 ± 0.285
0.309CysGly: 0.309 ± 0.186
0.103CysHis: 0.103 ± 0.099
0.309CysIle: 0.309 ± 0.23
0.618CysLys: 0.618 ± 0.214
0.309CysLeu: 0.309 ± 0.193
0.0CysMet: 0.0 ± 0.0
0.103CysAsn: 0.103 ± 0.1
0.103CysPro: 0.103 ± 0.1
0.309CysGln: 0.309 ± 0.19
0.206CysArg: 0.206 ± 0.154
0.309CysSer: 0.309 ± 0.2
0.206CysThr: 0.206 ± 0.166
0.309CysVal: 0.309 ± 0.171
0.0CysTrp: 0.0 ± 0.0
0.515CysTyr: 0.515 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
3.399AspAla: 3.399 ± 0.527
0.515AspCys: 0.515 ± 0.32
2.987AspAsp: 2.987 ± 0.617
5.15AspGlu: 5.15 ± 0.771
3.811AspPhe: 3.811 ± 0.564
5.768AspGly: 5.768 ± 0.874
1.03AspHis: 1.03 ± 0.349
4.944AspIle: 4.944 ± 0.549
4.532AspLys: 4.532 ± 0.536
5.047AspLeu: 5.047 ± 0.629
1.648AspMet: 1.648 ± 0.421
3.193AspAsn: 3.193 ± 0.512
1.545AspPro: 1.545 ± 0.543
1.854AspGln: 1.854 ± 0.475
1.751AspArg: 1.751 ± 0.491
2.781AspSer: 2.781 ± 0.449
2.987AspThr: 2.987 ± 0.619
3.708AspVal: 3.708 ± 0.669
1.03AspTrp: 1.03 ± 0.302
2.369AspTyr: 2.369 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
5.871GluAla: 5.871 ± 0.778
0.412GluCys: 0.412 ± 0.23
4.223GluAsp: 4.223 ± 0.864
10.3GluGlu: 10.3 ± 1.85
3.914GluPhe: 3.914 ± 0.793
3.605GluGly: 3.605 ± 0.623
0.824GluHis: 0.824 ± 0.297
6.386GluIle: 6.386 ± 0.832
8.446GluLys: 8.446 ± 1.072
9.476GluLeu: 9.476 ± 1.007
3.193GluMet: 3.193 ± 0.617
5.15GluAsn: 5.15 ± 0.57
1.339GluPro: 1.339 ± 0.47
3.708GluGln: 3.708 ± 0.404
2.678GluArg: 2.678 ± 0.543
3.914GluSer: 3.914 ± 0.705
4.12GluThr: 4.12 ± 0.971
6.592GluVal: 6.592 ± 0.953
0.927GluTrp: 0.927 ± 0.283
2.472GluTyr: 2.472 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.266PheAla: 2.266 ± 0.434
0.618PheCys: 0.618 ± 0.261
2.987PheAsp: 2.987 ± 0.497
4.738PheGlu: 4.738 ± 0.892
2.472PhePhe: 2.472 ± 0.772
3.914PheGly: 3.914 ± 0.529
0.412PheHis: 0.412 ± 0.235
4.017PheIle: 4.017 ± 0.613
4.429PheLys: 4.429 ± 0.553
1.957PheLeu: 1.957 ± 0.39
1.133PheMet: 1.133 ± 0.257
3.708PheAsn: 3.708 ± 0.542
1.339PhePro: 1.339 ± 0.411
1.339PheGln: 1.339 ± 0.342
1.648PheArg: 1.648 ± 0.383
2.06PheSer: 2.06 ± 0.36
2.575PheThr: 2.575 ± 0.543
2.678PheVal: 2.678 ± 0.393
0.309PheTrp: 0.309 ± 0.178
1.854PheTyr: 1.854 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
3.502GlyAla: 3.502 ± 0.86
0.103GlyCys: 0.103 ± 0.108
4.326GlyAsp: 4.326 ± 0.681
3.193GlyGlu: 3.193 ± 0.475
3.708GlyPhe: 3.708 ± 0.575
3.502GlyGly: 3.502 ± 0.494
1.545GlyHis: 1.545 ± 0.439
5.974GlyIle: 5.974 ± 1.547
6.592GlyLys: 6.592 ± 0.785
4.532GlyLeu: 4.532 ± 0.658
1.751GlyMet: 1.751 ± 0.435
3.811GlyAsn: 3.811 ± 0.533
1.03GlyPro: 1.03 ± 0.324
2.678GlyGln: 2.678 ± 0.66
2.266GlyArg: 2.266 ± 0.402
3.399GlySer: 3.399 ± 0.492
2.987GlyThr: 2.987 ± 0.501
3.502GlyVal: 3.502 ± 0.666
1.03GlyTrp: 1.03 ± 0.346
2.575GlyTyr: 2.575 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
0.618HisAla: 0.618 ± 0.231
0.206HisCys: 0.206 ± 0.141
0.618HisAsp: 0.618 ± 0.244
1.03HisGlu: 1.03 ± 0.372
1.442HisPhe: 1.442 ± 0.392
0.927HisGly: 0.927 ± 0.301
0.309HisHis: 0.309 ± 0.178
1.03HisIle: 1.03 ± 0.324
0.412HisLys: 0.412 ± 0.216
1.648HisLeu: 1.648 ± 0.507
0.515HisMet: 0.515 ± 0.236
0.824HisAsn: 0.824 ± 0.297
0.515HisPro: 0.515 ± 0.264
0.927HisGln: 0.927 ± 0.312
0.412HisArg: 0.412 ± 0.201
0.824HisSer: 0.824 ± 0.268
0.824HisThr: 0.824 ± 0.289
0.618HisVal: 0.618 ± 0.308
0.103HisTrp: 0.103 ± 0.111
0.412HisTyr: 0.412 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
5.459IleAla: 5.459 ± 0.915
0.618IleCys: 0.618 ± 0.279
5.356IleAsp: 5.356 ± 0.524
6.901IleGlu: 6.901 ± 1.068
3.502IlePhe: 3.502 ± 0.499
3.605IleGly: 3.605 ± 0.748
1.236IleHis: 1.236 ± 0.323
3.502IleIle: 3.502 ± 0.504
7.828IleLys: 7.828 ± 1.027
5.459IleLeu: 5.459 ± 0.688
1.339IleMet: 1.339 ± 0.36
3.811IleAsn: 3.811 ± 0.628
1.957IlePro: 1.957 ± 0.346
2.472IleGln: 2.472 ± 0.492
2.266IleArg: 2.266 ± 0.387
5.768IleSer: 5.768 ± 0.901
5.15IleThr: 5.15 ± 0.884
4.841IleVal: 4.841 ± 0.632
1.236IleTrp: 1.236 ± 0.645
1.854IleTyr: 1.854 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
8.549LysAla: 8.549 ± 1.251
0.103LysCys: 0.103 ± 0.096
4.944LysAsp: 4.944 ± 0.899
7.004LysGlu: 7.004 ± 0.827
3.193LysPhe: 3.193 ± 0.467
5.15LysGly: 5.15 ± 0.569
1.339LysHis: 1.339 ± 0.365
5.768LysIle: 5.768 ± 0.979
7.931LysLys: 7.931 ± 0.904
8.034LysLeu: 8.034 ± 1.021
2.884LysMet: 2.884 ± 0.466
5.562LysAsn: 5.562 ± 0.686
2.06LysPro: 2.06 ± 0.348
4.944LysGln: 4.944 ± 0.783
2.987LysArg: 2.987 ± 0.532
5.047LysSer: 5.047 ± 0.71
5.253LysThr: 5.253 ± 0.714
4.532LysVal: 4.532 ± 0.525
1.648LysTrp: 1.648 ± 0.429
2.781LysTyr: 2.781 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
6.077LeuAla: 6.077 ± 0.825
0.721LeuCys: 0.721 ± 0.409
5.253LeuAsp: 5.253 ± 0.704
7.725LeuGlu: 7.725 ± 1.164
3.296LeuPhe: 3.296 ± 0.689
4.841LeuGly: 4.841 ± 0.665
1.03LeuHis: 1.03 ± 0.334
4.841LeuIle: 4.841 ± 0.772
10.197LeuLys: 10.197 ± 0.997
5.15LeuLeu: 5.15 ± 0.614
2.163LeuMet: 2.163 ± 0.369
4.944LeuAsn: 4.944 ± 0.862
1.957LeuPro: 1.957 ± 0.499
3.708LeuGln: 3.708 ± 0.802
3.502LeuArg: 3.502 ± 0.606
4.326LeuSer: 4.326 ± 0.722
4.944LeuThr: 4.944 ± 0.544
4.841LeuVal: 4.841 ± 0.759
0.721LeuTrp: 0.721 ± 0.234
1.957LeuTyr: 1.957 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
1.957MetAla: 1.957 ± 0.506
0.103MetCys: 0.103 ± 0.118
1.545MetAsp: 1.545 ± 0.338
2.884MetGlu: 2.884 ± 0.53
1.133MetPhe: 1.133 ± 0.318
1.236MetGly: 1.236 ± 0.256
0.0MetHis: 0.0 ± 0.0
2.163MetIle: 2.163 ± 0.376
2.266MetLys: 2.266 ± 0.456
2.266MetLeu: 2.266 ± 0.51
0.412MetMet: 0.412 ± 0.18
1.957MetAsn: 1.957 ± 0.341
0.618MetPro: 0.618 ± 0.272
0.618MetGln: 0.618 ± 0.326
1.339MetArg: 1.339 ± 0.354
2.266MetSer: 2.266 ± 0.412
2.06MetThr: 2.06 ± 0.614
0.824MetVal: 0.824 ± 0.254
0.206MetTrp: 0.206 ± 0.15
0.824MetTyr: 0.824 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
3.605AsnAla: 3.605 ± 0.713
0.103AsnCys: 0.103 ± 0.089
3.605AsnAsp: 3.605 ± 0.572
4.738AsnGlu: 4.738 ± 0.634
2.575AsnPhe: 2.575 ± 0.49
5.768AsnGly: 5.768 ± 1.184
0.721AsnHis: 0.721 ± 0.254
4.429AsnIle: 4.429 ± 0.637
5.562AsnLys: 5.562 ± 0.547
4.12AsnLeu: 4.12 ± 0.598
1.133AsnMet: 1.133 ± 0.31
2.884AsnAsn: 2.884 ± 0.369
2.781AsnPro: 2.781 ± 0.658
3.296AsnGln: 3.296 ± 0.541
2.266AsnArg: 2.266 ± 0.418
3.914AsnSer: 3.914 ± 0.564
3.193AsnThr: 3.193 ± 0.446
3.605AsnVal: 3.605 ± 0.711
0.618AsnTrp: 0.618 ± 0.235
1.545AsnTyr: 1.545 ± 0.375
0.0AsnXaa: 0.0 ± 0.0
Pro
1.236ProAla: 1.236 ± 0.298
0.206ProCys: 0.206 ± 0.126
1.854ProAsp: 1.854 ± 0.701
1.442ProGlu: 1.442 ± 0.357
1.236ProPhe: 1.236 ± 0.319
1.236ProGly: 1.236 ± 0.423
0.618ProHis: 0.618 ± 0.245
2.266ProIle: 2.266 ± 0.44
1.545ProLys: 1.545 ± 0.507
2.163ProLeu: 2.163 ± 0.404
0.412ProMet: 0.412 ± 0.197
1.339ProAsn: 1.339 ± 0.379
0.309ProPro: 0.309 ± 0.168
1.133ProGln: 1.133 ± 0.407
1.236ProArg: 1.236 ± 0.371
1.751ProSer: 1.751 ± 0.389
1.957ProThr: 1.957 ± 0.391
1.648ProVal: 1.648 ± 0.462
0.309ProTrp: 0.309 ± 0.149
0.927ProTyr: 0.927 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
4.12GlnAla: 4.12 ± 0.692
0.103GlnCys: 0.103 ± 0.104
2.06GlnAsp: 2.06 ± 0.53
3.399GlnGlu: 3.399 ± 0.411
2.163GlnPhe: 2.163 ± 0.516
2.163GlnGly: 2.163 ± 0.303
0.824GlnHis: 0.824 ± 0.293
2.781GlnIle: 2.781 ± 0.546
4.017GlnLys: 4.017 ± 0.498
3.605GlnLeu: 3.605 ± 0.6
1.442GlnMet: 1.442 ± 0.486
2.369GlnAsn: 2.369 ± 0.561
1.236GlnPro: 1.236 ± 0.302
1.545GlnGln: 1.545 ± 0.469
1.751GlnArg: 1.751 ± 0.404
2.884GlnSer: 2.884 ± 0.505
2.987GlnThr: 2.987 ± 0.451
2.266GlnVal: 2.266 ± 0.502
0.412GlnTrp: 0.412 ± 0.216
1.133GlnTyr: 1.133 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
2.369ArgAla: 2.369 ± 0.42
0.0ArgCys: 0.0 ± 0.0
2.472ArgAsp: 2.472 ± 0.644
3.399ArgGlu: 3.399 ± 0.784
1.339ArgPhe: 1.339 ± 0.42
1.751ArgGly: 1.751 ± 0.372
0.927ArgHis: 0.927 ± 0.281
2.472ArgIle: 2.472 ± 0.507
4.635ArgLys: 4.635 ± 0.793
2.987ArgLeu: 2.987 ± 0.45
1.133ArgMet: 1.133 ± 0.358
3.09ArgAsn: 3.09 ± 0.566
1.03ArgPro: 1.03 ± 0.378
1.442ArgGln: 1.442 ± 0.331
1.442ArgArg: 1.442 ± 0.31
1.648ArgSer: 1.648 ± 0.437
1.339ArgThr: 1.339 ± 0.407
1.442ArgVal: 1.442 ± 0.356
0.618ArgTrp: 0.618 ± 0.289
1.133ArgTyr: 1.133 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
3.811SerAla: 3.811 ± 0.588
0.206SerCys: 0.206 ± 0.148
2.987SerAsp: 2.987 ± 0.549
4.738SerGlu: 4.738 ± 0.629
2.884SerPhe: 2.884 ± 0.529
4.12SerGly: 4.12 ± 0.764
0.412SerHis: 0.412 ± 0.182
5.974SerIle: 5.974 ± 0.79
3.502SerLys: 3.502 ± 0.556
5.047SerLeu: 5.047 ± 0.622
1.133SerMet: 1.133 ± 0.293
3.708SerAsn: 3.708 ± 0.486
1.648SerPro: 1.648 ± 0.484
2.987SerGln: 2.987 ± 0.557
1.854SerArg: 1.854 ± 0.515
4.532SerSer: 4.532 ± 0.985
3.811SerThr: 3.811 ± 0.778
3.09SerVal: 3.09 ± 0.434
0.618SerTrp: 0.618 ± 0.223
1.545SerTyr: 1.545 ± 0.391
0.0SerXaa: 0.0 ± 0.0
Thr
2.884ThrAla: 2.884 ± 0.811
0.309ThrCys: 0.309 ± 0.28
4.017ThrAsp: 4.017 ± 0.725
5.356ThrGlu: 5.356 ± 0.968
2.06ThrPhe: 2.06 ± 0.513
4.12ThrGly: 4.12 ± 0.685
0.618ThrHis: 0.618 ± 0.218
4.12ThrIle: 4.12 ± 0.669
3.811ThrLys: 3.811 ± 0.552
4.738ThrLeu: 4.738 ± 0.785
1.648ThrMet: 1.648 ± 0.482
2.678ThrAsn: 2.678 ± 0.414
1.133ThrPro: 1.133 ± 0.297
2.369ThrGln: 2.369 ± 0.405
2.884ThrArg: 2.884 ± 0.543
2.575ThrSer: 2.575 ± 0.377
3.605ThrThr: 3.605 ± 0.791
5.459ThrVal: 5.459 ± 0.803
0.515ThrTrp: 0.515 ± 0.214
1.751ThrTyr: 1.751 ± 0.432
0.0ThrXaa: 0.0 ± 0.0
Val
4.12ValAla: 4.12 ± 0.664
0.309ValCys: 0.309 ± 0.169
3.708ValAsp: 3.708 ± 0.565
6.386ValGlu: 6.386 ± 0.807
2.472ValPhe: 2.472 ± 0.432
4.017ValGly: 4.017 ± 0.833
1.236ValHis: 1.236 ± 0.458
3.914ValIle: 3.914 ± 0.514
6.18ValLys: 6.18 ± 0.908
4.532ValLeu: 4.532 ± 0.673
1.545ValMet: 1.545 ± 0.396
2.781ValAsn: 2.781 ± 0.484
1.442ValPro: 1.442 ± 0.43
2.781ValGln: 2.781 ± 0.617
1.854ValArg: 1.854 ± 0.442
4.429ValSer: 4.429 ± 0.636
3.399ValThr: 3.399 ± 0.746
4.12ValVal: 4.12 ± 0.641
0.515ValTrp: 0.515 ± 0.287
1.751ValTyr: 1.751 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.309TrpAla: 0.309 ± 0.153
0.0TrpCys: 0.0 ± 0.0
0.309TrpAsp: 0.309 ± 0.162
1.03TrpGlu: 1.03 ± 0.313
0.927TrpPhe: 0.927 ± 0.322
0.721TrpGly: 0.721 ± 0.215
0.206TrpHis: 0.206 ± 0.153
1.236TrpIle: 1.236 ± 0.366
0.309TrpLys: 0.309 ± 0.175
1.133TrpLeu: 1.133 ± 0.319
0.103TrpMet: 0.103 ± 0.126
1.957TrpAsn: 1.957 ± 0.906
0.206TrpPro: 0.206 ± 0.135
0.618TrpGln: 0.618 ± 0.255
0.618TrpArg: 0.618 ± 0.24
0.824TrpSer: 0.824 ± 0.235
0.412TrpThr: 0.412 ± 0.167
0.824TrpVal: 0.824 ± 0.31
0.103TrpTrp: 0.103 ± 0.094
0.206TrpTyr: 0.206 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.442TyrAla: 1.442 ± 0.315
0.412TyrCys: 0.412 ± 0.238
2.369TyrAsp: 2.369 ± 0.551
2.678TyrGlu: 2.678 ± 0.562
1.545TyrPhe: 1.545 ± 0.392
1.854TyrGly: 1.854 ± 0.418
0.412TyrHis: 0.412 ± 0.212
2.781TyrIle: 2.781 ± 0.535
2.163TyrLys: 2.163 ± 0.553
2.678TyrLeu: 2.678 ± 0.585
0.824TyrMet: 0.824 ± 0.248
1.854TyrAsn: 1.854 ± 0.412
1.236TyrPro: 1.236 ± 0.346
1.648TyrGln: 1.648 ± 0.445
1.648TyrArg: 1.648 ± 0.314
1.236TyrSer: 1.236 ± 0.363
1.442TyrThr: 1.442 ± 0.401
1.854TyrVal: 1.854 ± 0.516
0.412TyrTrp: 0.412 ± 0.201
0.927TyrTyr: 0.927 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (9710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski