Amino acid dipepetide frequency for Streptococcus satellite phage Javan267

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.375AlaCys: 0.375 ± 0.305
5.247AlaAsp: 5.247 ± 1.168
6.747AlaGlu: 6.747 ± 1.808
4.123AlaPhe: 4.123 ± 1.57
3.373AlaGly: 3.373 ± 0.996
0.75AlaHis: 0.75 ± 0.451
3.748AlaIle: 3.748 ± 1.082
4.498AlaLys: 4.498 ± 0.664
5.622AlaLeu: 5.622 ± 0.882
2.999AlaMet: 2.999 ± 1.43
2.999AlaAsn: 2.999 ± 0.755
1.499AlaPro: 1.499 ± 0.682
1.874AlaGln: 1.874 ± 0.517
2.999AlaArg: 2.999 ± 0.961
3.373AlaSer: 3.373 ± 1.191
3.373AlaThr: 3.373 ± 1.15
1.874AlaVal: 1.874 ± 0.849
0.0AlaTrp: 0.0 ± 0.0
1.499AlaTyr: 1.499 ± 0.635
0.0AlaXaa: 0.0 ± 0.0
Cys
0.75CysAla: 0.75 ± 0.744
0.0CysCys: 0.0 ± 0.0
1.124CysAsp: 1.124 ± 0.902
0.375CysGlu: 0.375 ± 0.398
0.0CysPhe: 0.0 ± 0.0
0.75CysGly: 0.75 ± 0.753
0.0CysHis: 0.0 ± 0.0
0.375CysIle: 0.375 ± 0.343
0.375CysLys: 0.375 ± 0.305
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.124CysAsn: 1.124 ± 0.634
0.375CysPro: 0.375 ± 0.377
1.124CysGln: 1.124 ± 0.828
0.75CysArg: 0.75 ± 0.49
0.375CysSer: 0.375 ± 0.431
0.375CysThr: 0.375 ± 0.318
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.124AspAla: 1.124 ± 0.943
0.75AspCys: 0.75 ± 0.496
2.249AspAsp: 2.249 ± 0.638
4.498AspGlu: 4.498 ± 1.438
2.999AspPhe: 2.999 ± 0.858
2.249AspGly: 2.249 ± 0.9
0.75AspHis: 0.75 ± 0.642
2.624AspIle: 2.624 ± 0.993
5.247AspLys: 5.247 ± 1.392
7.496AspLeu: 7.496 ± 1.421
2.624AspMet: 2.624 ± 0.676
2.249AspAsn: 2.249 ± 0.63
0.375AspPro: 0.375 ± 0.305
0.75AspGln: 0.75 ± 0.515
1.874AspArg: 1.874 ± 0.57
4.123AspSer: 4.123 ± 0.978
1.874AspThr: 1.874 ± 0.527
2.999AspVal: 2.999 ± 1.257
0.375AspTrp: 0.375 ± 0.305
5.622AspTyr: 5.622 ± 1.642
0.0AspXaa: 0.0 ± 0.0
Glu
5.622GluAla: 5.622 ± 1.034
0.375GluCys: 0.375 ± 0.372
3.373GluAsp: 3.373 ± 1.022
5.997GluGlu: 5.997 ± 1.463
1.499GluPhe: 1.499 ± 0.727
2.624GluGly: 2.624 ± 1.106
1.499GluHis: 1.499 ± 0.532
5.997GluIle: 5.997 ± 1.789
7.496GluLys: 7.496 ± 1.666
11.619GluLeu: 11.619 ± 2.081
1.499GluMet: 1.499 ± 1.089
4.873GluAsn: 4.873 ± 1.772
2.999GluPro: 2.999 ± 1.057
3.373GluGln: 3.373 ± 1.378
5.622GluArg: 5.622 ± 1.562
4.873GluSer: 4.873 ± 1.793
4.873GluThr: 4.873 ± 1.22
2.624GluVal: 2.624 ± 0.824
1.124GluTrp: 1.124 ± 0.634
4.498GluTyr: 4.498 ± 1.065
0.0GluXaa: 0.0 ± 0.0
Phe
1.124PheAla: 1.124 ± 0.634
0.375PheCys: 0.375 ± 0.422
4.498PheAsp: 4.498 ± 1.396
4.873PheGlu: 4.873 ± 1.545
2.999PhePhe: 2.999 ± 1.801
2.999PheGly: 2.999 ± 0.891
0.75PheHis: 0.75 ± 0.464
4.498PheIle: 4.498 ± 1.285
2.999PheLys: 2.999 ± 1.011
2.624PheLeu: 2.624 ± 0.605
0.0PheMet: 0.0 ± 0.0
3.748PheAsn: 3.748 ± 1.104
0.375PhePro: 0.375 ± 0.377
2.249PheGln: 2.249 ± 0.907
1.874PheArg: 1.874 ± 0.899
2.249PheSer: 2.249 ± 0.782
2.249PheThr: 2.249 ± 0.905
1.124PheVal: 1.124 ± 0.765
0.75PheTrp: 0.75 ± 0.49
2.624PheTyr: 2.624 ± 1.003
0.0PheXaa: 0.0 ± 0.0
Gly
3.373GlyAla: 3.373 ± 2.178
1.124GlyCys: 1.124 ± 0.735
2.999GlyAsp: 2.999 ± 1.177
2.999GlyGlu: 2.999 ± 0.863
1.874GlyPhe: 1.874 ± 1.149
2.249GlyGly: 2.249 ± 0.87
1.124GlyHis: 1.124 ± 0.597
4.123GlyIle: 4.123 ± 1.022
3.373GlyLys: 3.373 ± 1.138
4.498GlyLeu: 4.498 ± 1.227
0.75GlyMet: 0.75 ± 0.521
2.624GlyAsn: 2.624 ± 1.294
0.0GlyPro: 0.0 ± 0.0
1.124GlyGln: 1.124 ± 0.69
2.624GlyArg: 2.624 ± 0.843
1.499GlySer: 1.499 ± 0.828
3.748GlyThr: 3.748 ± 1.389
3.373GlyVal: 3.373 ± 1.379
1.124GlyTrp: 1.124 ± 0.725
3.748GlyTyr: 3.748 ± 1.615
0.0GlyXaa: 0.0 ± 0.0
His
3.373HisAla: 3.373 ± 1.267
0.0HisCys: 0.0 ± 0.0
0.375HisAsp: 0.375 ± 0.372
0.75HisGlu: 0.75 ± 0.61
1.124HisPhe: 1.124 ± 0.596
1.499HisGly: 1.499 ± 0.716
0.375HisHis: 0.375 ± 0.372
0.375HisIle: 0.375 ± 0.43
1.874HisLys: 1.874 ± 1.049
1.874HisLeu: 1.874 ± 0.677
0.375HisMet: 0.375 ± 0.318
0.375HisAsn: 0.375 ± 0.305
0.75HisPro: 0.75 ± 0.593
0.375HisGln: 0.375 ± 0.447
0.375HisArg: 0.375 ± 0.333
1.499HisSer: 1.499 ± 0.624
1.499HisThr: 1.499 ± 0.696
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.124HisTyr: 1.124 ± 0.551
0.0HisXaa: 0.0 ± 0.0
Ile
3.748IleAla: 3.748 ± 1.135
0.75IleCys: 0.75 ± 0.539
3.373IleAsp: 3.373 ± 0.884
7.871IleGlu: 7.871 ± 2.769
2.999IlePhe: 2.999 ± 1.19
2.999IleGly: 2.999 ± 0.794
0.375IleHis: 0.375 ± 0.377
5.622IleIle: 5.622 ± 1.48
7.871IleLys: 7.871 ± 1.781
2.999IleLeu: 2.999 ± 0.983
0.375IleMet: 0.375 ± 0.305
4.123IleAsn: 4.123 ± 1.467
4.123IlePro: 4.123 ± 0.782
2.249IleGln: 2.249 ± 0.487
1.124IleArg: 1.124 ± 0.606
9.37IleSer: 9.37 ± 1.514
4.873IleThr: 4.873 ± 1.481
3.373IleVal: 3.373 ± 1.144
0.375IleTrp: 0.375 ± 0.43
2.624IleTyr: 2.624 ± 0.795
0.0IleXaa: 0.0 ± 0.0
Lys
7.121LysAla: 7.121 ± 1.797
0.375LysCys: 0.375 ± 0.372
2.999LysAsp: 2.999 ± 1.449
10.87LysGlu: 10.87 ± 1.963
1.874LysPhe: 1.874 ± 0.827
5.247LysGly: 5.247 ± 1.635
3.373LysHis: 3.373 ± 1.02
6.372LysIle: 6.372 ± 1.555
3.373LysLys: 3.373 ± 1.34
7.871LysLeu: 7.871 ± 1.59
1.124LysMet: 1.124 ± 0.73
5.997LysAsn: 5.997 ± 1.222
6.372LysPro: 6.372 ± 1.439
2.999LysGln: 2.999 ± 0.866
6.747LysArg: 6.747 ± 1.658
4.873LysSer: 4.873 ± 1.65
5.622LysThr: 5.622 ± 0.979
4.873LysVal: 4.873 ± 0.954
1.124LysTrp: 1.124 ± 0.646
1.499LysTyr: 1.499 ± 0.743
0.0LysXaa: 0.0 ± 0.0
Leu
7.496LeuAla: 7.496 ± 1.834
0.75LeuCys: 0.75 ± 0.49
7.871LeuAsp: 7.871 ± 1.183
9.37LeuGlu: 9.37 ± 1.982
4.498LeuPhe: 4.498 ± 1.121
3.373LeuGly: 3.373 ± 1.637
1.124LeuHis: 1.124 ± 0.619
7.121LeuIle: 7.121 ± 1.517
8.246LeuLys: 8.246 ± 1.336
11.994LeuLeu: 11.994 ± 2.434
2.249LeuMet: 2.249 ± 0.939
5.247LeuAsn: 5.247 ± 1.11
2.624LeuPro: 2.624 ± 0.651
3.748LeuGln: 3.748 ± 1.377
3.748LeuArg: 3.748 ± 1.098
6.747LeuSer: 6.747 ± 1.786
6.747LeuThr: 6.747 ± 1.191
3.748LeuVal: 3.748 ± 1.038
0.75LeuTrp: 0.75 ± 0.464
4.873LeuTyr: 4.873 ± 1.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.249MetAla: 2.249 ± 0.893
0.0MetCys: 0.0 ± 0.0
2.249MetAsp: 2.249 ± 1.037
2.624MetGlu: 2.624 ± 0.763
0.75MetPhe: 0.75 ± 0.553
0.375MetGly: 0.375 ± 0.305
0.0MetHis: 0.0 ± 0.0
0.75MetIle: 0.75 ± 0.449
2.249MetLys: 2.249 ± 0.865
1.499MetLeu: 1.499 ± 0.831
0.0MetMet: 0.0 ± 0.0
2.624MetAsn: 2.624 ± 0.906
0.375MetPro: 0.375 ± 0.305
0.375MetGln: 0.375 ± 0.372
1.499MetArg: 1.499 ± 0.888
1.124MetSer: 1.124 ± 0.523
2.624MetThr: 2.624 ± 1.383
1.874MetVal: 1.874 ± 0.906
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.874AsnAla: 1.874 ± 0.647
0.375AsnCys: 0.375 ± 0.398
4.873AsnAsp: 4.873 ± 1.342
3.748AsnGlu: 3.748 ± 1.134
0.375AsnPhe: 0.375 ± 0.398
4.123AsnGly: 4.123 ± 1.105
1.874AsnHis: 1.874 ± 0.744
2.624AsnIle: 2.624 ± 1.051
6.747AsnLys: 6.747 ± 1.055
5.247AsnLeu: 5.247 ± 1.166
1.124AsnMet: 1.124 ± 0.635
2.999AsnAsn: 2.999 ± 0.935
2.999AsnPro: 2.999 ± 1.135
1.499AsnGln: 1.499 ± 0.779
3.748AsnArg: 3.748 ± 1.447
2.249AsnSer: 2.249 ± 0.722
2.249AsnThr: 2.249 ± 0.868
2.624AsnVal: 2.624 ± 0.92
0.0AsnTrp: 0.0 ± 0.0
1.499AsnTyr: 1.499 ± 0.754
0.0AsnXaa: 0.0 ± 0.0
Pro
2.249ProAla: 2.249 ± 0.653
0.0ProCys: 0.0 ± 0.0
1.124ProAsp: 1.124 ± 0.518
2.999ProGlu: 2.999 ± 0.957
3.748ProPhe: 3.748 ± 1.369
1.124ProGly: 1.124 ± 0.835
0.0ProHis: 0.0 ± 0.0
2.249ProIle: 2.249 ± 0.869
5.247ProLys: 5.247 ± 1.702
1.499ProLeu: 1.499 ± 0.674
1.124ProMet: 1.124 ± 0.594
2.249ProAsn: 2.249 ± 1.489
1.124ProPro: 1.124 ± 0.64
1.499ProGln: 1.499 ± 0.625
2.624ProArg: 2.624 ± 1.187
0.75ProSer: 0.75 ± 0.414
1.499ProThr: 1.499 ± 0.656
3.373ProVal: 3.373 ± 0.781
0.375ProTrp: 0.375 ± 0.305
1.124ProTyr: 1.124 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
3.748GlnAla: 3.748 ± 1.951
0.375GlnCys: 0.375 ± 0.431
0.75GlnAsp: 0.75 ± 0.517
3.373GlnGlu: 3.373 ± 0.865
0.75GlnPhe: 0.75 ± 0.492
1.124GlnGly: 1.124 ± 1.028
0.0GlnHis: 0.0 ± 0.0
1.874GlnIle: 1.874 ± 0.88
4.498GlnLys: 4.498 ± 0.983
4.498GlnLeu: 4.498 ± 1.007
0.75GlnMet: 0.75 ± 0.574
1.499GlnAsn: 1.499 ± 0.51
1.499GlnPro: 1.499 ± 0.848
2.249GlnGln: 2.249 ± 1.065
2.249GlnArg: 2.249 ± 0.512
1.124GlnSer: 1.124 ± 0.738
1.499GlnThr: 1.499 ± 0.567
2.624GlnVal: 2.624 ± 0.853
0.0GlnTrp: 0.0 ± 0.0
2.249GlnTyr: 2.249 ± 0.946
0.0GlnXaa: 0.0 ± 0.0
Arg
3.373ArgAla: 3.373 ± 0.917
0.75ArgCys: 0.75 ± 0.49
1.874ArgAsp: 1.874 ± 0.62
4.873ArgGlu: 4.873 ± 1.228
2.999ArgPhe: 2.999 ± 1.077
2.249ArgGly: 2.249 ± 1.04
1.124ArgHis: 1.124 ± 0.465
4.123ArgIle: 4.123 ± 1.354
5.247ArgLys: 5.247 ± 1.952
5.997ArgLeu: 5.997 ± 1.769
1.874ArgMet: 1.874 ± 0.611
1.874ArgAsn: 1.874 ± 0.908
1.124ArgPro: 1.124 ± 0.537
2.624ArgGln: 2.624 ± 1.119
1.874ArgArg: 1.874 ± 0.728
1.874ArgSer: 1.874 ± 0.608
4.123ArgThr: 4.123 ± 1.002
2.249ArgVal: 2.249 ± 0.701
0.0ArgTrp: 0.0 ± 0.0
2.624ArgTyr: 2.624 ± 1.029
0.0ArgXaa: 0.0 ± 0.0
Ser
1.874SerAla: 1.874 ± 0.642
0.375SerCys: 0.375 ± 0.377
3.373SerAsp: 3.373 ± 1.028
3.373SerGlu: 3.373 ± 0.838
2.624SerPhe: 2.624 ± 1.057
3.373SerGly: 3.373 ± 0.82
1.874SerHis: 1.874 ± 0.731
5.997SerIle: 5.997 ± 1.158
4.123SerLys: 4.123 ± 1.443
8.621SerLeu: 8.621 ± 1.702
1.124SerMet: 1.124 ± 0.634
2.249SerAsn: 2.249 ± 0.98
1.124SerPro: 1.124 ± 0.497
1.124SerGln: 1.124 ± 0.849
2.999SerArg: 2.999 ± 1.378
2.624SerSer: 2.624 ± 1.01
2.999SerThr: 2.999 ± 0.642
4.123SerVal: 4.123 ± 1.404
0.375SerTrp: 0.375 ± 0.305
3.748SerTyr: 3.748 ± 1.474
0.0SerXaa: 0.0 ± 0.0
Thr
2.624ThrAla: 2.624 ± 0.906
0.0ThrCys: 0.0 ± 0.0
1.874ThrAsp: 1.874 ± 0.8
1.874ThrGlu: 1.874 ± 0.859
5.247ThrPhe: 5.247 ± 1.984
3.373ThrGly: 3.373 ± 0.803
1.124ThrHis: 1.124 ± 0.521
5.247ThrIle: 5.247 ± 1.482
5.247ThrLys: 5.247 ± 1.514
7.871ThrLeu: 7.871 ± 1.322
1.874ThrMet: 1.874 ± 0.571
0.75ThrAsn: 0.75 ± 0.414
2.999ThrPro: 2.999 ± 0.906
1.874ThrGln: 1.874 ± 0.669
4.123ThrArg: 4.123 ± 0.956
2.999ThrSer: 2.999 ± 0.763
2.624ThrThr: 2.624 ± 1.101
4.498ThrVal: 4.498 ± 1.173
0.75ThrTrp: 0.75 ± 0.461
2.249ThrTyr: 2.249 ± 1.424
0.0ThrXaa: 0.0 ± 0.0
Val
3.373ValAla: 3.373 ± 0.973
0.375ValCys: 0.375 ± 0.305
1.499ValAsp: 1.499 ± 0.531
3.373ValGlu: 3.373 ± 1.368
1.874ValPhe: 1.874 ± 0.653
1.874ValGly: 1.874 ± 0.679
0.75ValHis: 0.75 ± 0.637
2.249ValIle: 2.249 ± 0.731
6.372ValLys: 6.372 ± 1.237
4.498ValLeu: 4.498 ± 1.491
0.75ValMet: 0.75 ± 0.578
1.874ValAsn: 1.874 ± 0.701
3.373ValPro: 3.373 ± 1.14
1.499ValGln: 1.499 ± 0.923
2.999ValArg: 2.999 ± 0.793
2.624ValSer: 2.624 ± 1.248
5.247ValThr: 5.247 ± 1.207
4.123ValVal: 4.123 ± 1.192
0.375ValTrp: 0.375 ± 0.333
2.249ValTyr: 2.249 ± 0.751
0.0ValXaa: 0.0 ± 0.0
Trp
0.375TrpAla: 0.375 ± 0.43
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.75TrpGlu: 0.75 ± 0.469
0.0TrpPhe: 0.0 ± 0.0
0.375TrpGly: 0.375 ± 0.333
0.375TrpHis: 0.375 ± 0.305
1.124TrpIle: 1.124 ± 0.508
0.0TrpLys: 0.0 ± 0.0
1.124TrpLeu: 1.124 ± 0.551
0.0TrpMet: 0.0 ± 0.0
0.375TrpAsn: 0.375 ± 0.43
0.375TrpPro: 0.375 ± 0.305
0.375TrpGln: 0.375 ± 0.305
0.375TrpArg: 0.375 ± 0.305
1.124TrpSer: 1.124 ± 0.525
0.0TrpThr: 0.0 ± 0.0
0.75TrpVal: 0.75 ± 0.424
0.0TrpTrp: 0.0 ± 0.0
0.375TrpTyr: 0.375 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.499TyrAla: 1.499 ± 0.666
0.75TyrCys: 0.75 ± 0.536
1.124TyrAsp: 1.124 ± 0.655
1.499TyrGlu: 1.499 ± 0.879
2.624TyrPhe: 2.624 ± 0.975
2.999TyrGly: 2.999 ± 0.823
0.75TyrHis: 0.75 ± 0.458
4.123TyrIle: 4.123 ± 1.024
5.622TyrLys: 5.622 ± 1.581
4.873TyrLeu: 4.873 ± 1.231
2.249TyrMet: 2.249 ± 0.765
2.999TyrAsn: 2.999 ± 1.262
1.499TyrPro: 1.499 ± 1.082
3.748TyrGln: 3.748 ± 0.738
2.624TyrArg: 2.624 ± 0.963
2.624TyrSer: 2.624 ± 0.845
1.124TyrThr: 1.124 ± 0.537
1.124TyrVal: 1.124 ± 0.511
0.375TyrTrp: 0.375 ± 0.377
1.874TyrTyr: 1.874 ± 0.874
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2669 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski