Amino acid dipepetide frequency for Streptococcus satellite phage Javan287

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.997AlaAla: 0.997 ± 0.88
0.499AlaCys: 0.499 ± 0.347
2.991AlaAsp: 2.991 ± 0.954
6.481AlaGlu: 6.481 ± 2.234
1.496AlaPhe: 1.496 ± 1.047
2.991AlaGly: 2.991 ± 1.101
0.0AlaHis: 0.0 ± 0.0
3.988AlaIle: 3.988 ± 1.816
8.973AlaLys: 8.973 ± 3.592
1.994AlaLeu: 1.994 ± 0.787
2.991AlaMet: 2.991 ± 1.036
2.991AlaAsn: 2.991 ± 1.229
0.499AlaPro: 0.499 ± 0.444
1.496AlaGln: 1.496 ± 0.593
1.496AlaArg: 1.496 ± 0.827
3.49AlaSer: 3.49 ± 0.923
4.985AlaThr: 4.985 ± 1.861
1.994AlaVal: 1.994 ± 0.841
1.496AlaTrp: 1.496 ± 0.712
3.49AlaTyr: 3.49 ± 1.26
0.0AlaXaa: 0.0 ± 0.0
Cys
0.499CysAla: 0.499 ± 0.499
0.0CysCys: 0.0 ± 0.0
0.499CysAsp: 0.499 ± 0.511
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.499CysGly: 0.499 ± 0.371
0.0CysHis: 0.0 ± 0.0
1.496CysIle: 1.496 ± 0.674
0.499CysLys: 0.499 ± 0.451
0.499CysLeu: 0.499 ± 0.511
0.0CysMet: 0.0 ± 0.0
0.997CysAsn: 0.997 ± 0.549
1.994CysPro: 1.994 ± 0.995
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.499CysVal: 0.499 ± 0.497
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.997AspAla: 0.997 ± 0.88
0.499AspCys: 0.499 ± 0.371
2.493AspAsp: 2.493 ± 0.818
3.988AspGlu: 3.988 ± 1.604
3.49AspPhe: 3.49 ± 0.889
0.499AspGly: 0.499 ± 0.371
0.0AspHis: 0.0 ± 0.0
6.481AspIle: 6.481 ± 1.362
7.478AspLys: 7.478 ± 0.968
5.484AspLeu: 5.484 ± 1.264
0.997AspMet: 0.997 ± 0.549
3.988AspAsn: 3.988 ± 1.52
0.997AspPro: 0.997 ± 0.661
1.994AspGln: 1.994 ± 1.046
3.988AspArg: 3.988 ± 1.316
1.496AspSer: 1.496 ± 0.609
3.49AspThr: 3.49 ± 0.898
1.994AspVal: 1.994 ± 0.661
1.496AspTrp: 1.496 ± 0.778
4.985AspTyr: 4.985 ± 1.697
0.0AspXaa: 0.0 ± 0.0
Glu
3.988GluAla: 3.988 ± 1.824
1.496GluCys: 1.496 ± 0.87
4.985GluAsp: 4.985 ± 1.754
7.478GluGlu: 7.478 ± 2.481
2.493GluPhe: 2.493 ± 1.154
5.982GluGly: 5.982 ± 1.551
0.499GluHis: 0.499 ± 0.371
6.979GluIle: 6.979 ± 2.192
8.973GluLys: 8.973 ± 2.42
15.952GluLeu: 15.952 ± 2.864
1.994GluMet: 1.994 ± 1.097
4.985GluAsn: 4.985 ± 0.982
1.994GluPro: 1.994 ± 0.637
2.991GluGln: 2.991 ± 0.849
2.991GluArg: 2.991 ± 1.613
3.988GluSer: 3.988 ± 2.292
1.496GluThr: 1.496 ± 0.609
4.985GluVal: 4.985 ± 1.543
0.997GluTrp: 0.997 ± 0.556
2.991GluTyr: 2.991 ± 1.223
0.0GluXaa: 0.0 ± 0.0
Phe
2.493PheAla: 2.493 ± 1.438
1.496PheCys: 1.496 ± 0.676
1.994PheAsp: 1.994 ± 1.062
2.493PheGlu: 2.493 ± 0.715
1.496PhePhe: 1.496 ± 0.675
2.991PheGly: 2.991 ± 0.82
0.997PheHis: 0.997 ± 0.486
2.493PheIle: 2.493 ± 0.883
2.493PheLys: 2.493 ± 0.71
4.487PheLeu: 4.487 ± 1.119
0.499PheMet: 0.499 ± 0.353
1.994PheAsn: 1.994 ± 0.855
0.499PhePro: 0.499 ± 0.444
0.997PheGln: 0.997 ± 0.543
0.499PheArg: 0.499 ± 0.451
1.994PheSer: 1.994 ± 0.769
1.994PheThr: 1.994 ± 1.079
1.496PheVal: 1.496 ± 0.688
0.499PheTrp: 0.499 ± 0.371
1.496PheTyr: 1.496 ± 1.122
0.0PheXaa: 0.0 ± 0.0
Gly
1.994GlyAla: 1.994 ± 1.063
0.0GlyCys: 0.0 ± 0.0
0.997GlyAsp: 0.997 ± 0.486
3.988GlyGlu: 3.988 ± 1.451
1.496GlyPhe: 1.496 ± 1.068
1.994GlyGly: 1.994 ± 0.717
0.499GlyHis: 0.499 ± 0.347
3.988GlyIle: 3.988 ± 1.418
4.487GlyLys: 4.487 ± 0.985
4.985GlyLeu: 4.985 ± 1.223
1.496GlyMet: 1.496 ± 0.74
4.487GlyAsn: 4.487 ± 1.839
0.0GlyPro: 0.0 ± 0.0
0.997GlyGln: 0.997 ± 0.581
1.496GlyArg: 1.496 ± 0.609
1.496GlySer: 1.496 ± 0.746
1.496GlyThr: 1.496 ± 0.747
2.991GlyVal: 2.991 ± 1.114
0.499GlyTrp: 0.499 ± 0.499
3.988GlyTyr: 3.988 ± 1.192
0.0GlyXaa: 0.0 ± 0.0
His
0.997HisAla: 0.997 ± 0.695
0.0HisCys: 0.0 ± 0.0
0.499HisAsp: 0.499 ± 0.444
0.997HisGlu: 0.997 ± 0.661
0.997HisPhe: 0.997 ± 0.486
0.997HisGly: 0.997 ± 0.486
0.0HisHis: 0.0 ± 0.0
0.499HisIle: 0.499 ± 0.347
0.499HisLys: 0.499 ± 0.371
1.496HisLeu: 1.496 ± 0.941
0.0HisMet: 0.0 ± 0.0
1.994HisAsn: 1.994 ± 0.82
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.499HisSer: 0.499 ± 0.347
0.499HisThr: 0.499 ± 0.347
0.499HisVal: 0.499 ± 0.511
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.487IleAla: 4.487 ± 2.028
0.0IleCys: 0.0 ± 0.0
6.481IleAsp: 6.481 ± 1.787
6.481IleGlu: 6.481 ± 2.074
2.493IlePhe: 2.493 ± 1.18
1.994IleGly: 1.994 ± 0.639
0.0IleHis: 0.0 ± 0.0
2.991IleIle: 2.991 ± 1.12
7.976IleLys: 7.976 ± 1.232
7.478IleLeu: 7.478 ± 1.321
2.493IleMet: 2.493 ± 0.787
6.481IleAsn: 6.481 ± 1.58
2.493IlePro: 2.493 ± 1.091
2.493IleGln: 2.493 ± 1.043
1.496IleArg: 1.496 ± 0.739
2.991IleSer: 2.991 ± 1.121
3.49IleThr: 3.49 ± 0.907
4.487IleVal: 4.487 ± 1.576
0.499IleTrp: 0.499 ± 0.52
2.991IleTyr: 2.991 ± 1.254
0.0IleXaa: 0.0 ± 0.0
Lys
8.475LysAla: 8.475 ± 2.24
1.496LysCys: 1.496 ± 0.673
3.988LysAsp: 3.988 ± 1.264
11.964LysGlu: 11.964 ± 2.164
2.991LysPhe: 2.991 ± 0.929
5.982LysGly: 5.982 ± 1.547
0.997LysHis: 0.997 ± 0.553
5.484LysIle: 5.484 ± 2.021
10.469LysLys: 10.469 ± 1.786
12.961LysLeu: 12.961 ± 2.572
1.994LysMet: 1.994 ± 1.064
7.976LysAsn: 7.976 ± 1.655
2.493LysPro: 2.493 ± 0.936
6.979LysGln: 6.979 ± 2.254
4.985LysArg: 4.985 ± 2.178
7.976LysSer: 7.976 ± 2.009
5.982LysThr: 5.982 ± 1.304
5.982LysVal: 5.982 ± 0.885
1.496LysTrp: 1.496 ± 0.677
5.484LysTyr: 5.484 ± 1.411
0.0LysXaa: 0.0 ± 0.0
Leu
6.481LeuAla: 6.481 ± 1.411
0.0LeuCys: 0.0 ± 0.0
7.976LeuAsp: 7.976 ± 1.75
12.463LeuGlu: 12.463 ± 3.04
3.49LeuPhe: 3.49 ± 1.101
4.985LeuGly: 4.985 ± 1.621
0.499LeuHis: 0.499 ± 0.347
5.484LeuIle: 5.484 ± 1.192
16.949LeuLys: 16.949 ± 2.705
9.472LeuLeu: 9.472 ± 1.995
3.49LeuMet: 3.49 ± 1.062
4.985LeuAsn: 4.985 ± 1.369
2.493LeuPro: 2.493 ± 0.847
3.49LeuGln: 3.49 ± 1.178
4.487LeuArg: 4.487 ± 1.497
6.979LeuSer: 6.979 ± 2.101
5.484LeuThr: 5.484 ± 1.695
4.487LeuVal: 4.487 ± 1.21
0.499LeuTrp: 0.499 ± 0.347
3.988LeuTyr: 3.988 ± 0.739
0.0LeuXaa: 0.0 ± 0.0
Met
1.496MetAla: 1.496 ± 0.858
0.499MetCys: 0.499 ± 0.451
1.994MetAsp: 1.994 ± 0.982
2.991MetGlu: 2.991 ± 1.338
0.499MetPhe: 0.499 ± 0.522
0.499MetGly: 0.499 ± 0.54
0.0MetHis: 0.0 ± 0.0
0.997MetIle: 0.997 ± 0.637
0.997MetLys: 0.997 ± 0.6
1.994MetLeu: 1.994 ± 0.934
0.0MetMet: 0.0 ± 0.0
1.496MetAsn: 1.496 ± 0.632
0.0MetPro: 0.0 ± 0.0
1.496MetGln: 1.496 ± 0.86
1.496MetArg: 1.496 ± 0.81
1.496MetSer: 1.496 ± 1.074
3.988MetThr: 3.988 ± 1.086
0.499MetVal: 0.499 ± 0.499
0.0MetTrp: 0.0 ± 0.0
0.499MetTyr: 0.499 ± 0.52
0.0MetXaa: 0.0 ± 0.0
Asn
4.487AsnAla: 4.487 ± 1.185
0.499AsnCys: 0.499 ± 0.511
2.991AsnAsp: 2.991 ± 1.3
3.49AsnGlu: 3.49 ± 1.537
1.496AsnPhe: 1.496 ± 0.783
3.988AsnGly: 3.988 ± 0.844
0.499AsnHis: 0.499 ± 0.511
3.49AsnIle: 3.49 ± 1.344
6.481AsnLys: 6.481 ± 1.184
4.487AsnLeu: 4.487 ± 0.991
1.496AsnMet: 1.496 ± 0.864
3.49AsnAsn: 3.49 ± 0.807
2.991AsnPro: 2.991 ± 1.142
1.994AsnGln: 1.994 ± 0.906
3.49AsnArg: 3.49 ± 0.783
2.991AsnSer: 2.991 ± 1.281
7.478AsnThr: 7.478 ± 2.58
4.487AsnVal: 4.487 ± 1.142
1.496AsnTrp: 1.496 ± 0.858
2.991AsnTyr: 2.991 ± 0.688
0.0AsnXaa: 0.0 ± 0.0
Pro
0.997ProAla: 0.997 ± 0.467
0.0ProCys: 0.0 ± 0.0
1.496ProAsp: 1.496 ± 0.83
1.496ProGlu: 1.496 ± 0.953
2.493ProPhe: 2.493 ± 1.109
0.499ProGly: 0.499 ± 0.451
0.0ProHis: 0.0 ± 0.0
1.496ProIle: 1.496 ± 0.706
3.49ProLys: 3.49 ± 1.539
1.994ProLeu: 1.994 ± 0.843
0.499ProMet: 0.499 ± 0.371
0.997ProAsn: 0.997 ± 0.467
0.499ProPro: 0.499 ± 0.347
0.499ProGln: 0.499 ± 0.371
1.496ProArg: 1.496 ± 0.528
0.0ProSer: 0.0 ± 0.0
1.994ProThr: 1.994 ± 0.877
1.994ProVal: 1.994 ± 1.017
0.0ProTrp: 0.0 ± 0.0
0.499ProTyr: 0.499 ± 0.444
0.0ProXaa: 0.0 ± 0.0
Gln
3.988GlnAla: 3.988 ± 1.725
0.0GlnCys: 0.0 ± 0.0
0.997GlnAsp: 0.997 ± 0.629
5.982GlnGlu: 5.982 ± 1.545
0.499GlnPhe: 0.499 ± 0.347
0.0GlnGly: 0.0 ± 0.0
1.496GlnHis: 1.496 ± 0.629
2.991GlnIle: 2.991 ± 1.128
3.988GlnLys: 3.988 ± 1.132
6.979GlnLeu: 6.979 ± 1.961
0.0GlnMet: 0.0 ± 0.0
1.994GlnAsn: 1.994 ± 0.906
1.496GlnPro: 1.496 ± 0.768
6.979GlnGln: 6.979 ± 1.416
0.997GlnArg: 0.997 ± 0.486
3.988GlnSer: 3.988 ± 1.355
0.997GlnThr: 0.997 ± 0.846
1.994GlnVal: 1.994 ± 1.107
0.0GlnTrp: 0.0 ± 0.0
0.997GlnTyr: 0.997 ± 0.553
0.0GlnXaa: 0.0 ± 0.0
Arg
2.991ArgAla: 2.991 ± 0.938
0.0ArgCys: 0.0 ± 0.0
2.991ArgAsp: 2.991 ± 1.077
2.493ArgGlu: 2.493 ± 0.467
0.499ArgPhe: 0.499 ± 0.451
1.994ArgGly: 1.994 ± 0.697
0.499ArgHis: 0.499 ± 0.347
2.493ArgIle: 2.493 ± 1.02
5.982ArgLys: 5.982 ± 1.104
4.487ArgLeu: 4.487 ± 1.114
0.0ArgMet: 0.0 ± 0.0
1.496ArgAsn: 1.496 ± 0.567
0.0ArgPro: 0.0 ± 0.0
3.988ArgGln: 3.988 ± 0.874
0.997ArgArg: 0.997 ± 0.695
2.493ArgSer: 2.493 ± 0.718
0.499ArgThr: 0.499 ± 0.568
1.994ArgVal: 1.994 ± 0.833
0.0ArgTrp: 0.0 ± 0.0
0.997ArgTyr: 0.997 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
0.997SerAla: 0.997 ± 0.467
0.499SerCys: 0.499 ± 0.371
4.985SerAsp: 4.985 ± 1.344
3.988SerGlu: 3.988 ± 1.134
1.496SerPhe: 1.496 ± 0.632
0.997SerGly: 0.997 ± 0.701
0.499SerHis: 0.499 ± 0.499
3.988SerIle: 3.988 ± 1.162
5.484SerLys: 5.484 ± 1.435
5.982SerLeu: 5.982 ± 0.972
2.991SerMet: 2.991 ± 0.932
1.496SerAsn: 1.496 ± 0.869
0.499SerPro: 0.499 ± 0.371
1.994SerGln: 1.994 ± 1.257
0.997SerArg: 0.997 ± 0.532
1.994SerSer: 1.994 ± 0.626
4.487SerThr: 4.487 ± 0.896
3.49SerVal: 3.49 ± 1.278
0.0SerTrp: 0.0 ± 0.0
3.988SerTyr: 3.988 ± 1.088
0.0SerXaa: 0.0 ± 0.0
Thr
2.991ThrAla: 2.991 ± 1.291
0.0ThrCys: 0.0 ± 0.0
2.991ThrAsp: 2.991 ± 0.959
3.988ThrGlu: 3.988 ± 1.417
3.49ThrPhe: 3.49 ± 1.604
1.994ThrGly: 1.994 ± 1.063
0.499ThrHis: 0.499 ± 0.347
6.481ThrIle: 6.481 ± 1.246
4.985ThrLys: 4.985 ± 1.939
6.979ThrLeu: 6.979 ± 1.369
0.499ThrMet: 0.499 ± 0.371
5.484ThrAsn: 5.484 ± 1.601
1.496ThrPro: 1.496 ± 0.679
3.49ThrGln: 3.49 ± 1.117
0.997ThrArg: 0.997 ± 0.618
0.499ThrSer: 0.499 ± 0.52
4.487ThrThr: 4.487 ± 1.135
2.493ThrVal: 2.493 ± 1.043
0.0ThrTrp: 0.0 ± 0.0
2.493ThrTyr: 2.493 ± 0.777
0.0ThrXaa: 0.0 ± 0.0
Val
1.496ValAla: 1.496 ± 0.644
0.499ValCys: 0.499 ± 0.499
2.493ValAsp: 2.493 ± 1.347
1.994ValGlu: 1.994 ± 0.966
2.493ValPhe: 2.493 ± 1.119
1.994ValGly: 1.994 ± 0.817
2.493ValHis: 2.493 ± 1.037
4.487ValIle: 4.487 ± 0.97
7.478ValLys: 7.478 ± 1.863
4.487ValLeu: 4.487 ± 1.876
0.499ValMet: 0.499 ± 0.488
3.988ValAsn: 3.988 ± 1.626
1.994ValPro: 1.994 ± 0.572
2.493ValGln: 2.493 ± 0.972
2.991ValArg: 2.991 ± 1.524
3.49ValSer: 3.49 ± 0.947
0.997ValThr: 0.997 ± 0.553
1.994ValVal: 1.994 ± 0.659
0.0ValTrp: 0.0 ± 0.0
2.493ValTyr: 2.493 ± 0.613
0.0ValXaa: 0.0 ± 0.0
Trp
0.997TrpAla: 0.997 ± 0.467
0.0TrpCys: 0.0 ± 0.0
0.499TrpAsp: 0.499 ± 0.347
1.994TrpGlu: 1.994 ± 0.572
0.0TrpPhe: 0.0 ± 0.0
0.499TrpGly: 0.499 ± 0.511
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.499TrpLys: 0.499 ± 0.444
1.496TrpLeu: 1.496 ± 0.678
0.0TrpMet: 0.0 ± 0.0
0.997TrpAsn: 0.997 ± 0.605
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.499TrpArg: 0.499 ± 0.371
0.499TrpSer: 0.499 ± 0.347
0.0TrpThr: 0.0 ± 0.0
0.997TrpVal: 0.997 ± 0.698
0.499TrpTrp: 0.499 ± 0.347
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.49TyrAla: 3.49 ± 1.224
0.0TyrCys: 0.0 ± 0.0
2.493TyrAsp: 2.493 ± 1.209
3.49TyrGlu: 3.49 ± 1.461
1.994TyrPhe: 1.994 ± 0.905
1.994TyrGly: 1.994 ± 1.106
0.997TyrHis: 0.997 ± 0.661
3.988TyrIle: 3.988 ± 1.269
8.475TyrLys: 8.475 ± 1.721
4.487TyrLeu: 4.487 ± 0.836
0.499TyrMet: 0.499 ± 0.44
2.493TyrAsn: 2.493 ± 1.046
0.0TyrPro: 0.0 ± 0.0
1.994TyrGln: 1.994 ± 0.626
1.496TyrArg: 1.496 ± 0.747
2.493TyrSer: 2.493 ± 0.847
2.493TyrThr: 2.493 ± 0.938
1.496TyrVal: 1.496 ± 0.63
0.0TyrTrp: 0.0 ± 0.0
2.991TyrTyr: 2.991 ± 1.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2007 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski