Amino acid dipepetide frequency for Gata virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.117AlaAla: 1.117 ± 0.413
0.838AlaCys: 0.838 ± 0.423
1.676AlaAsp: 1.676 ± 1.295
1.397AlaGlu: 1.397 ± 0.612
0.838AlaPhe: 0.838 ± 0.858
2.793AlaGly: 2.793 ± 0.766
0.559AlaHis: 0.559 ± 0.313
3.073AlaIle: 3.073 ± 0.837
1.397AlaLys: 1.397 ± 0.891
6.145AlaLeu: 6.145 ± 1.956
0.559AlaMet: 0.559 ± 0.417
2.235AlaAsn: 2.235 ± 0.853
2.235AlaPro: 2.235 ± 0.944
1.676AlaGln: 1.676 ± 0.5
2.514AlaArg: 2.514 ± 1.191
3.352AlaSer: 3.352 ± 0.711
1.955AlaThr: 1.955 ± 0.293
1.117AlaVal: 1.117 ± 0.321
1.117AlaTrp: 1.117 ± 0.427
2.793AlaTyr: 2.793 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
1.117CysAla: 1.117 ± 0.413
1.117CysCys: 1.117 ± 1.486
0.559CysAsp: 0.559 ± 0.605
0.0CysGlu: 0.0 ± 0.0
0.279CysPhe: 0.279 ± 0.157
0.838CysGly: 0.838 ± 0.256
0.279CysHis: 0.279 ± 0.371
1.117CysIle: 1.117 ± 0.427
0.559CysLys: 0.559 ± 0.278
2.793CysLeu: 2.793 ± 0.872
0.0CysMet: 0.0 ± 0.0
0.559CysAsn: 0.559 ± 0.278
0.279CysPro: 0.279 ± 0.371
0.559CysGln: 0.559 ± 0.417
1.117CysArg: 1.117 ± 0.556
3.073CysSer: 3.073 ± 1.263
1.117CysThr: 1.117 ± 0.556
1.676CysVal: 1.676 ± 0.679
0.279CysTrp: 0.279 ± 0.157
0.279CysTyr: 0.279 ± 0.371
0.0CysXaa: 0.0 ± 0.0
Asp
3.073AspAla: 3.073 ± 1.25
0.838AspCys: 0.838 ± 0.327
2.235AspAsp: 2.235 ± 0.917
2.793AspGlu: 2.793 ± 2.188
2.235AspPhe: 2.235 ± 0.557
3.631AspGly: 3.631 ± 0.877
1.676AspHis: 1.676 ± 0.846
4.19AspIle: 4.19 ± 1.011
4.19AspLys: 4.19 ± 0.583
6.145AspLeu: 6.145 ± 1.096
1.955AspMet: 1.955 ± 0.861
3.352AspAsn: 3.352 ± 0.45
4.19AspPro: 4.19 ± 1.049
1.955AspGln: 1.955 ± 0.56
2.793AspArg: 2.793 ± 0.842
3.911AspSer: 3.911 ± 0.781
4.469AspThr: 4.469 ± 0.58
3.352AspVal: 3.352 ± 0.889
0.279AspTrp: 0.279 ± 0.371
1.676AspTyr: 1.676 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
1.955GluAla: 1.955 ± 0.899
0.559GluCys: 0.559 ± 0.743
2.235GluAsp: 2.235 ± 0.966
3.911GluGlu: 3.911 ± 0.581
3.911GluPhe: 3.911 ± 1.22
3.073GluGly: 3.073 ± 1.151
0.279GluHis: 0.279 ± 0.157
3.631GluIle: 3.631 ± 1.035
2.793GluLys: 2.793 ± 0.842
6.145GluLeu: 6.145 ± 0.97
1.955GluMet: 1.955 ± 1.177
0.838GluAsn: 0.838 ± 0.621
2.514GluPro: 2.514 ± 0.48
1.117GluGln: 1.117 ± 0.608
1.676GluArg: 1.676 ± 0.5
3.352GluSer: 3.352 ± 1.142
3.631GluThr: 3.631 ± 0.994
2.235GluVal: 2.235 ± 0.634
0.559GluTrp: 0.559 ± 0.589
3.352GluTyr: 3.352 ± 0.913
0.0GluXaa: 0.0 ± 0.0
Phe
2.793PheAla: 2.793 ± 0.909
0.559PheCys: 0.559 ± 0.542
1.955PheAsp: 1.955 ± 0.53
2.235PheGlu: 2.235 ± 0.992
2.514PhePhe: 2.514 ± 1.653
0.559PheGly: 0.559 ± 0.278
2.235PheHis: 2.235 ± 0.634
2.235PheIle: 2.235 ± 1.715
4.19PheLys: 4.19 ± 1.049
6.425PheLeu: 6.425 ± 1.126
1.676PheMet: 1.676 ± 0.578
1.676PheAsn: 1.676 ± 0.571
2.793PhePro: 2.793 ± 0.569
1.955PheGln: 1.955 ± 0.478
3.073PheArg: 3.073 ± 0.987
2.793PheSer: 2.793 ± 0.809
2.514PheThr: 2.514 ± 0.951
2.514PheVal: 2.514 ± 0.73
0.279PheTrp: 0.279 ± 0.484
0.279PheTyr: 0.279 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
3.631GlyAla: 3.631 ± 1.328
1.117GlyCys: 1.117 ± 0.459
2.793GlyAsp: 2.793 ± 0.477
4.469GlyGlu: 4.469 ± 1.061
1.955GlyPhe: 1.955 ± 0.86
1.955GlyGly: 1.955 ± 0.715
1.117GlyHis: 1.117 ± 0.321
4.469GlyIle: 4.469 ± 0.416
3.073GlyLys: 3.073 ± 1.023
9.218GlyLeu: 9.218 ± 0.96
0.559GlyMet: 0.559 ± 0.313
2.514GlyAsn: 2.514 ± 0.763
1.117GlyPro: 1.117 ± 0.321
2.793GlyGln: 2.793 ± 1.258
3.352GlyArg: 3.352 ± 0.938
5.307GlySer: 5.307 ± 1.033
2.514GlyThr: 2.514 ± 0.838
3.073GlyVal: 3.073 ± 0.786
0.838GlyTrp: 0.838 ± 0.47
1.955GlyTyr: 1.955 ± 1.165
0.0GlyXaa: 0.0 ± 0.0
His
0.559HisAla: 0.559 ± 0.313
0.0HisCys: 0.0 ± 0.0
0.838HisAsp: 0.838 ± 0.487
1.117HisGlu: 1.117 ± 0.321
1.676HisPhe: 1.676 ± 0.571
1.676HisGly: 1.676 ± 0.462
0.559HisHis: 0.559 ± 0.313
2.235HisIle: 2.235 ± 0.573
2.235HisLys: 2.235 ± 0.557
1.117HisLeu: 1.117 ± 0.488
0.0HisMet: 0.0 ± 0.412
0.559HisAsn: 0.559 ± 0.313
1.955HisPro: 1.955 ± 0.782
0.838HisGln: 0.838 ± 0.47
1.397HisArg: 1.397 ± 0.532
1.955HisSer: 1.955 ± 1.017
0.838HisThr: 0.838 ± 0.433
2.514HisVal: 2.514 ± 0.73
0.279HisTrp: 0.279 ± 0.157
1.955HisTyr: 1.955 ± 0.782
0.0HisXaa: 0.0 ± 0.0
Ile
2.235IleAla: 2.235 ± 0.853
0.559IleCys: 0.559 ± 0.278
2.793IleAsp: 2.793 ± 1.064
2.514IleGlu: 2.514 ± 0.703
2.235IlePhe: 2.235 ± 0.366
3.911IleGly: 3.911 ± 0.997
1.676IleHis: 1.676 ± 0.866
4.749IleIle: 4.749 ± 0.897
6.145IleLys: 6.145 ± 0.646
5.028IleLeu: 5.028 ± 1.775
1.676IleMet: 1.676 ± 0.5
3.073IleAsn: 3.073 ± 0.773
5.028IlePro: 5.028 ± 1.724
1.397IleGln: 1.397 ± 0.851
5.028IleArg: 5.028 ± 1.285
6.704IleSer: 6.704 ± 1.386
5.587IleThr: 5.587 ± 1.197
4.749IleVal: 4.749 ± 1.266
0.838IleTrp: 0.838 ± 0.487
3.073IleTyr: 3.073 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
1.676LysAla: 1.676 ± 0.356
1.117LysCys: 1.117 ± 1.005
3.352LysAsp: 3.352 ± 0.245
2.514LysGlu: 2.514 ± 0.48
2.514LysPhe: 2.514 ± 0.583
3.352LysGly: 3.352 ± 1.474
1.397LysHis: 1.397 ± 0.532
5.587LysIle: 5.587 ± 0.753
4.469LysLys: 4.469 ± 1.479
5.307LysLeu: 5.307 ± 1.455
1.955LysMet: 1.955 ± 0.81
3.352LysAsn: 3.352 ± 0.97
3.352LysPro: 3.352 ± 0.677
1.955LysGln: 1.955 ± 0.642
2.514LysArg: 2.514 ± 0.306
5.587LysSer: 5.587 ± 1.642
4.469LysThr: 4.469 ± 1.027
3.911LysVal: 3.911 ± 1.183
2.514LysTrp: 2.514 ± 0.48
0.838LysTyr: 0.838 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
3.352LeuAla: 3.352 ± 1.244
1.397LeuCys: 1.397 ± 0.511
7.821LeuAsp: 7.821 ± 0.982
6.145LeuGlu: 6.145 ± 0.631
3.352LeuPhe: 3.352 ± 0.78
6.145LeuGly: 6.145 ± 1.222
3.073LeuHis: 3.073 ± 0.52
10.056LeuIle: 10.056 ± 3.424
5.866LeuLys: 5.866 ± 0.984
7.821LeuLeu: 7.821 ± 2.154
2.514LeuMet: 2.514 ± 0.629
8.659LeuAsn: 8.659 ± 1.198
2.514LeuPro: 2.514 ± 0.629
0.838LeuGln: 0.838 ± 0.417
6.145LeuArg: 6.145 ± 2.198
10.894LeuSer: 10.894 ± 1.282
5.307LeuThr: 5.307 ± 1.432
4.469LeuVal: 4.469 ± 1.699
0.279LeuTrp: 0.279 ± 0.157
3.073LeuTyr: 3.073 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
1.397MetAla: 1.397 ± 0.762
0.559MetCys: 0.559 ± 0.432
1.676MetAsp: 1.676 ± 0.7
2.235MetGlu: 2.235 ± 0.697
1.117MetPhe: 1.117 ± 0.724
1.676MetGly: 1.676 ± 1.018
0.279MetHis: 0.279 ± 0.484
1.955MetIle: 1.955 ± 0.43
1.397MetLys: 1.397 ± 0.58
2.235MetLeu: 2.235 ± 0.965
0.279MetMet: 0.279 ± 0.157
0.838MetAsn: 0.838 ± 0.514
0.838MetPro: 0.838 ± 0.621
0.559MetGln: 0.559 ± 0.304
1.397MetArg: 1.397 ± 0.784
3.073MetSer: 3.073 ± 0.722
1.676MetThr: 1.676 ± 0.97
0.838MetVal: 0.838 ± 0.47
0.559MetTrp: 0.559 ± 0.313
1.397MetTyr: 1.397 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
2.235AsnAla: 2.235 ± 0.78
0.838AsnCys: 0.838 ± 0.256
4.469AsnAsp: 4.469 ± 0.548
1.117AsnGlu: 1.117 ± 0.483
2.514AsnPhe: 2.514 ± 0.75
1.117AsnGly: 1.117 ± 1.221
1.955AsnHis: 1.955 ± 0.81
3.911AsnIle: 3.911 ± 1.435
2.793AsnLys: 2.793 ± 1.162
5.866AsnLeu: 5.866 ± 0.901
0.279AsnMet: 0.279 ± 0.157
3.631AsnAsn: 3.631 ± 1.062
3.631AsnPro: 3.631 ± 1.193
3.073AsnGln: 3.073 ± 1.183
1.676AsnArg: 1.676 ± 0.509
5.866AsnSer: 5.866 ± 1.206
1.955AsnThr: 1.955 ± 0.461
3.073AsnVal: 3.073 ± 0.616
1.397AsnTrp: 1.397 ± 0.436
1.955AsnTyr: 1.955 ± 0.293
0.0AsnXaa: 0.0 ± 0.0
Pro
1.397ProAla: 1.397 ± 0.532
0.279ProCys: 0.279 ± 0.157
4.469ProAsp: 4.469 ± 1.279
1.397ProGlu: 1.397 ± 0.623
1.117ProPhe: 1.117 ± 0.863
3.352ProGly: 3.352 ± 2.485
1.397ProHis: 1.397 ± 0.511
2.793ProIle: 2.793 ± 1.402
3.073ProLys: 3.073 ± 0.524
4.469ProLeu: 4.469 ± 0.911
0.838ProMet: 0.838 ± 0.49
3.073ProAsn: 3.073 ± 1.389
3.352ProPro: 3.352 ± 1.574
1.117ProGln: 1.117 ± 0.427
1.676ProArg: 1.676 ± 0.571
5.307ProSer: 5.307 ± 1.166
3.352ProThr: 3.352 ± 1.49
3.352ProVal: 3.352 ± 0.724
0.279ProTrp: 0.279 ± 0.157
1.676ProTyr: 1.676 ± 0.517
0.0ProXaa: 0.0 ± 0.0
Gln
1.676GlnAla: 1.676 ± 1.436
0.838GlnCys: 0.838 ± 0.47
3.073GlnAsp: 3.073 ± 0.527
1.955GlnGlu: 1.955 ± 1.277
1.955GlnPhe: 1.955 ± 1.03
1.676GlnGly: 1.676 ± 0.94
0.838GlnHis: 0.838 ± 0.423
1.676GlnIle: 1.676 ± 0.571
0.838GlnLys: 0.838 ± 0.256
1.676GlnLeu: 1.676 ± 0.667
0.559GlnMet: 0.559 ± 0.304
2.235GlnAsn: 2.235 ± 0.573
0.559GlnPro: 0.559 ± 0.432
0.838GlnGln: 0.838 ± 0.871
1.676GlnArg: 1.676 ± 0.94
3.911GlnSer: 3.911 ± 0.828
2.235GlnThr: 2.235 ± 1.003
2.514GlnVal: 2.514 ± 0.889
0.559GlnTrp: 0.559 ± 0.417
1.117GlnTyr: 1.117 ± 0.627
0.0GlnXaa: 0.0 ± 0.0
Arg
2.793ArgAla: 2.793 ± 0.958
1.117ArgCys: 1.117 ± 0.627
4.19ArgAsp: 4.19 ± 0.78
2.514ArgGlu: 2.514 ± 0.389
3.352ArgPhe: 3.352 ± 1.177
2.235ArgGly: 2.235 ± 0.957
2.514ArgHis: 2.514 ± 0.75
1.955ArgIle: 1.955 ± 0.48
1.955ArgLys: 1.955 ± 0.855
3.352ArgLeu: 3.352 ± 1.474
3.073ArgMet: 3.073 ± 1.113
3.911ArgAsn: 3.911 ± 0.843
2.514ArgPro: 2.514 ± 1.211
1.676ArgGln: 1.676 ± 0.894
1.955ArgArg: 1.955 ± 0.55
6.145ArgSer: 6.145 ± 1.104
2.514ArgThr: 2.514 ± 1.431
2.235ArgVal: 2.235 ± 0.825
1.117ArgTrp: 1.117 ± 0.76
0.559ArgTyr: 0.559 ± 0.304
0.0ArgXaa: 0.0 ± 0.0
Ser
3.911SerAla: 3.911 ± 1.091
2.235SerCys: 2.235 ± 1.112
4.749SerAsp: 4.749 ± 1.44
5.028SerGlu: 5.028 ± 2.213
5.028SerPhe: 5.028 ± 1.123
7.821SerGly: 7.821 ± 1.226
2.235SerHis: 2.235 ± 0.459
4.749SerIle: 4.749 ± 0.843
6.145SerLys: 6.145 ± 1.368
9.497SerLeu: 9.497 ± 1.254
3.073SerMet: 3.073 ± 0.334
3.352SerAsn: 3.352 ± 0.922
4.469SerPro: 4.469 ± 1.402
3.911SerGln: 3.911 ± 1.312
5.028SerArg: 5.028 ± 1.007
7.821SerSer: 7.821 ± 0.922
5.307SerThr: 5.307 ± 0.592
5.307SerVal: 5.307 ± 0.635
2.793SerTrp: 2.793 ± 0.73
4.469SerTyr: 4.469 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
1.397ThrAla: 1.397 ± 0.285
0.559ThrCys: 0.559 ± 0.278
3.352ThrAsp: 3.352 ± 0.4
2.514ThrGlu: 2.514 ± 0.433
2.235ThrPhe: 2.235 ± 0.863
4.469ThrGly: 4.469 ± 1.577
0.838ThrHis: 0.838 ± 0.637
3.352ThrIle: 3.352 ± 1.33
2.793ThrLys: 2.793 ± 0.458
5.587ThrLeu: 5.587 ± 1.541
1.397ThrMet: 1.397 ± 0.784
2.793ThrAsn: 2.793 ± 0.575
2.514ThrPro: 2.514 ± 0.925
2.235ThrGln: 2.235 ± 0.445
3.352ThrArg: 3.352 ± 1.157
4.19ThrSer: 4.19 ± 0.598
2.514ThrThr: 2.514 ± 1.106
5.307ThrVal: 5.307 ± 1.652
1.676ThrTrp: 1.676 ± 0.513
3.073ThrTyr: 3.073 ± 1.263
0.0ThrXaa: 0.0 ± 0.0
Val
1.397ValAla: 1.397 ± 1.099
1.676ValCys: 1.676 ± 0.38
4.469ValAsp: 4.469 ± 1.07
2.235ValGlu: 2.235 ± 0.76
3.631ValPhe: 3.631 ± 1.059
3.911ValGly: 3.911 ± 1.755
0.838ValHis: 0.838 ± 0.644
3.352ValIle: 3.352 ± 0.95
3.631ValLys: 3.631 ± 0.78
4.749ValLeu: 4.749 ± 0.784
2.235ValMet: 2.235 ± 0.528
3.631ValAsn: 3.631 ± 1.046
2.514ValPro: 2.514 ± 0.604
2.793ValGln: 2.793 ± 0.693
3.073ValArg: 3.073 ± 1.183
5.587ValSer: 5.587 ± 0.715
2.793ValThr: 2.793 ± 0.46
2.514ValVal: 2.514 ± 1.191
0.838ValTrp: 0.838 ± 0.514
1.955ValTyr: 1.955 ± 1.621
0.0ValXaa: 0.0 ± 0.0
Trp
1.117TrpAla: 1.117 ± 1.173
0.279TrpCys: 0.279 ± 0.484
0.838TrpAsp: 0.838 ± 0.417
2.235TrpGlu: 2.235 ± 0.706
1.676TrpPhe: 1.676 ± 0.513
1.676TrpGly: 1.676 ± 0.94
0.279TrpHis: 0.279 ± 0.157
1.117TrpIle: 1.117 ± 0.729
0.559TrpLys: 0.559 ± 0.313
0.838TrpLeu: 0.838 ± 0.778
0.559TrpMet: 0.559 ± 0.278
0.838TrpAsn: 0.838 ± 0.433
0.559TrpPro: 0.559 ± 0.313
0.0TrpGln: 0.0 ± 0.0
0.838TrpArg: 0.838 ± 0.423
2.235TrpSer: 2.235 ± 0.863
0.559TrpThr: 0.559 ± 0.417
0.559TrpVal: 0.559 ± 0.589
0.279TrpTrp: 0.279 ± 0.157
0.838TrpTyr: 0.838 ± 0.256
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.559TyrAla: 0.559 ± 0.278
1.117TyrCys: 1.117 ± 1.323
1.397TyrAsp: 1.397 ± 0.436
1.397TyrGlu: 1.397 ± 0.628
1.117TyrPhe: 1.117 ± 0.74
1.955TyrGly: 1.955 ± 0.458
0.559TyrHis: 0.559 ± 0.417
2.793TyrIle: 2.793 ± 1.017
3.631TyrLys: 3.631 ± 0.691
5.307TyrLeu: 5.307 ± 0.96
0.559TyrMet: 0.559 ± 0.304
2.235TyrAsn: 2.235 ± 0.863
0.838TyrPro: 0.838 ± 0.327
1.117TyrGln: 1.117 ± 0.321
1.397TyrArg: 1.397 ± 0.285
5.866TyrSer: 5.866 ± 1.59
0.838TyrThr: 0.838 ± 0.423
2.514TyrVal: 2.514 ± 1.061
1.117TyrTrp: 1.117 ± 0.496
0.279TyrTyr: 0.279 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski