Amino acid dipepetide frequency for Streptomyces sp. 150FB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.855AlaAla: 19.855 ± 0.163
0.973AlaCys: 0.973 ± 0.023
8.223AlaAsp: 8.223 ± 0.069
8.442AlaGlu: 8.442 ± 0.089
3.558AlaPhe: 3.558 ± 0.041
13.032AlaGly: 13.032 ± 0.092
2.818AlaHis: 2.818 ± 0.039
3.679AlaIle: 3.679 ± 0.048
3.028AlaLys: 3.028 ± 0.052
14.054AlaLeu: 14.054 ± 0.097
2.6AlaMet: 2.6 ± 0.038
2.057AlaAsn: 2.057 ± 0.037
6.707AlaPro: 6.707 ± 0.068
3.553AlaGln: 3.553 ± 0.045
9.599AlaArg: 9.599 ± 0.09
6.112AlaSer: 6.112 ± 0.058
6.956AlaThr: 6.956 ± 0.066
11.818AlaVal: 11.818 ± 0.087
1.85AlaTrp: 1.85 ± 0.034
2.771AlaTyr: 2.771 ± 0.042
0.001AlaXaa: 0.001 ± 0.001
Cys
0.997CysAla: 0.997 ± 0.021
0.073CysCys: 0.073 ± 0.005
0.486CysAsp: 0.486 ± 0.018
0.373CysGlu: 0.373 ± 0.012
0.205CysPhe: 0.205 ± 0.009
0.887CysGly: 0.887 ± 0.022
0.189CysHis: 0.189 ± 0.009
0.135CysIle: 0.135 ± 0.008
0.099CysLys: 0.099 ± 0.007
0.67CysLeu: 0.67 ± 0.018
0.109CysMet: 0.109 ± 0.007
0.123CysAsn: 0.123 ± 0.009
0.443CysPro: 0.443 ± 0.016
0.162CysGln: 0.162 ± 0.009
0.528CysArg: 0.528 ± 0.016
0.438CysSer: 0.438 ± 0.015
0.462CysThr: 0.462 ± 0.015
0.677CysVal: 0.677 ± 0.017
0.116CysTrp: 0.116 ± 0.007
0.144CysTyr: 0.144 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
7.553AspAla: 7.553 ± 0.062
0.414AspCys: 0.414 ± 0.014
3.686AspAsp: 3.686 ± 0.051
3.935AspGlu: 3.935 ± 0.049
1.785AspPhe: 1.785 ± 0.029
6.289AspGly: 6.289 ± 0.06
1.519AspHis: 1.519 ± 0.025
2.144AspIle: 2.144 ± 0.034
1.293AspLys: 1.293 ± 0.028
6.435AspLeu: 6.435 ± 0.055
0.823AspMet: 0.823 ± 0.019
1.082AspAsn: 1.082 ± 0.024
4.532AspPro: 4.532 ± 0.049
1.704AspGln: 1.704 ± 0.03
4.627AspArg: 4.627 ± 0.052
2.907AspSer: 2.907 ± 0.04
3.375AspThr: 3.375 ± 0.05
4.727AspVal: 4.727 ± 0.048
1.015AspTrp: 1.015 ± 0.022
1.148AspTyr: 1.148 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
7.316GluAla: 7.316 ± 0.067
0.339GluCys: 0.339 ± 0.011
2.769GluAsp: 2.769 ± 0.038
3.284GluGlu: 3.284 ± 0.046
1.541GluPhe: 1.541 ± 0.028
4.314GluGly: 4.314 ± 0.044
1.502GluHis: 1.502 ± 0.023
2.354GluIle: 2.354 ± 0.038
1.419GluLys: 1.419 ± 0.027
6.693GluLeu: 6.693 ± 0.061
0.995GluMet: 0.995 ± 0.023
1.091GluAsn: 1.091 ± 0.026
3.231GluPro: 3.231 ± 0.051
2.274GluGln: 2.274 ± 0.034
5.267GluArg: 5.267 ± 0.051
2.788GluSer: 2.788 ± 0.039
2.913GluThr: 2.913 ± 0.034
4.374GluVal: 4.374 ± 0.051
0.764GluTrp: 0.764 ± 0.018
1.102GluTyr: 1.102 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
3.749PheAla: 3.749 ± 0.048
0.263PheCys: 0.263 ± 0.011
2.11PheAsp: 2.11 ± 0.034
1.485PheGlu: 1.485 ± 0.026
0.941PhePhe: 0.941 ± 0.028
3.113PheGly: 3.113 ± 0.037
0.593PheHis: 0.593 ± 0.019
0.851PheIle: 0.851 ± 0.022
0.557PheLys: 0.557 ± 0.019
2.581PheLeu: 2.581 ± 0.037
0.432PheMet: 0.432 ± 0.016
0.63PheAsn: 0.63 ± 0.018
1.482PhePro: 1.482 ± 0.027
0.755PheGln: 0.755 ± 0.019
1.81PheArg: 1.81 ± 0.028
1.691PheSer: 1.691 ± 0.033
2.086PheThr: 2.086 ± 0.03
2.286PheVal: 2.286 ± 0.034
0.44PheTrp: 0.44 ± 0.013
0.617PheTyr: 0.617 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
11.125GlyAla: 11.125 ± 0.084
0.768GlyCys: 0.768 ± 0.021
5.285GlyAsp: 5.285 ± 0.057
5.089GlyGlu: 5.089 ± 0.048
2.971GlyPhe: 2.971 ± 0.04
9.02GlyGly: 9.02 ± 0.107
2.346GlyHis: 2.346 ± 0.032
3.664GlyIle: 3.664 ± 0.047
2.609GlyLys: 2.609 ± 0.046
9.391GlyLeu: 9.391 ± 0.074
2.023GlyMet: 2.023 ± 0.033
1.929GlyAsn: 1.929 ± 0.039
5.102GlyPro: 5.102 ± 0.06
2.832GlyGln: 2.832 ± 0.044
7.352GlyArg: 7.352 ± 0.062
5.74GlySer: 5.74 ± 0.062
6.423GlyThr: 6.423 ± 0.066
7.467GlyVal: 7.467 ± 0.065
1.662GlyTrp: 1.662 ± 0.03
2.309GlyTyr: 2.309 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.592HisAla: 2.592 ± 0.038
0.206HisCys: 0.206 ± 0.009
1.332HisAsp: 1.332 ± 0.028
1.182HisGlu: 1.182 ± 0.024
0.705HisPhe: 0.705 ± 0.02
2.321HisGly: 2.321 ± 0.032
0.685HisHis: 0.685 ± 0.022
0.75HisIle: 0.75 ± 0.02
0.376HisLys: 0.376 ± 0.011
2.524HisLeu: 2.524 ± 0.04
0.35HisMet: 0.35 ± 0.013
0.427HisAsn: 0.427 ± 0.016
1.761HisPro: 1.761 ± 0.034
0.704HisGln: 0.704 ± 0.02
1.934HisArg: 1.934 ± 0.033
1.108HisSer: 1.108 ± 0.022
1.366HisThr: 1.366 ± 0.026
1.683HisVal: 1.683 ± 0.032
0.376HisTrp: 0.376 ± 0.013
0.493HisTyr: 0.493 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
4.966IleAla: 4.966 ± 0.048
0.315IleCys: 0.315 ± 0.014
2.45IleAsp: 2.45 ± 0.035
2.141IleGlu: 2.141 ± 0.032
0.844IlePhe: 0.844 ± 0.021
3.781IleGly: 3.781 ± 0.044
0.699IleHis: 0.699 ± 0.018
1.014IleIle: 1.014 ± 0.024
0.786IleLys: 0.786 ± 0.021
2.636IleLeu: 2.636 ± 0.038
0.49IleMet: 0.49 ± 0.016
0.82IleAsn: 0.82 ± 0.023
1.963IlePro: 1.963 ± 0.031
0.808IleGln: 0.808 ± 0.019
2.294IleArg: 2.294 ± 0.036
1.97IleSer: 1.97 ± 0.032
2.392IleThr: 2.392 ± 0.037
2.843IleVal: 2.843 ± 0.039
0.417IleTrp: 0.417 ± 0.015
0.615IleTyr: 0.615 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
3.041LysAla: 3.041 ± 0.05
0.12LysCys: 0.12 ± 0.007
1.397LysAsp: 1.397 ± 0.031
1.164LysGlu: 1.164 ± 0.024
0.482LysPhe: 0.482 ± 0.016
1.896LysGly: 1.896 ± 0.04
0.453LysHis: 0.453 ± 0.014
0.994LysIle: 0.994 ± 0.027
0.934LysLys: 0.934 ± 0.03
2.148LysLeu: 2.148 ± 0.035
0.405LysMet: 0.405 ± 0.015
0.604LysAsn: 0.604 ± 0.021
1.394LysPro: 1.394 ± 0.028
0.768LysGln: 0.768 ± 0.021
1.454LysArg: 1.454 ± 0.028
1.284LysSer: 1.284 ± 0.027
1.433LysThr: 1.433 ± 0.033
1.925LysVal: 1.925 ± 0.036
0.293LysTrp: 0.293 ± 0.013
0.465LysTyr: 0.465 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
14.549LeuAla: 14.549 ± 0.097
0.766LeuCys: 0.766 ± 0.019
6.87LeuAsp: 6.87 ± 0.062
4.694LeuGlu: 4.694 ± 0.05
2.712LeuPhe: 2.712 ± 0.037
9.187LeuGly: 9.187 ± 0.062
2.274LeuHis: 2.274 ± 0.037
3.623LeuIle: 3.623 ± 0.039
2.069LeuLys: 2.069 ± 0.035
10.885LeuLeu: 10.885 ± 0.104
1.669LeuMet: 1.669 ± 0.03
1.835LeuAsn: 1.835 ± 0.034
6.347LeuPro: 6.347 ± 0.065
2.128LeuGln: 2.128 ± 0.034
8.59LeuArg: 8.59 ± 0.081
5.803LeuSer: 5.803 ± 0.06
6.838LeuThr: 6.838 ± 0.05
8.511LeuVal: 8.511 ± 0.076
1.254LeuTrp: 1.254 ± 0.025
1.906LeuTyr: 1.906 ± 0.028
0.0LeuXaa: 0.0 ± 0.0
Met
2.293MetAla: 2.293 ± 0.031
0.129MetCys: 0.129 ± 0.007
0.951MetAsp: 0.951 ± 0.019
0.827MetGlu: 0.827 ± 0.019
0.463MetPhe: 0.463 ± 0.015
1.341MetGly: 1.341 ± 0.03
0.355MetHis: 0.355 ± 0.01
0.694MetIle: 0.694 ± 0.019
0.436MetLys: 0.436 ± 0.017
1.737MetLeu: 1.737 ± 0.033
0.31MetMet: 0.31 ± 0.013
0.48MetAsn: 0.48 ± 0.016
1.166MetPro: 1.166 ± 0.023
0.432MetGln: 0.432 ± 0.015
1.484MetArg: 1.484 ± 0.026
1.426MetSer: 1.426 ± 0.028
1.527MetThr: 1.527 ± 0.028
1.359MetVal: 1.359 ± 0.025
0.222MetTrp: 0.222 ± 0.01
0.367MetTyr: 0.367 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.317AsnAla: 2.317 ± 0.036
0.173AsnCys: 0.173 ± 0.009
1.083AsnAsp: 1.083 ± 0.027
0.887AsnGlu: 0.887 ± 0.02
0.57AsnPhe: 0.57 ± 0.017
2.094AsnGly: 2.094 ± 0.041
0.427AsnHis: 0.427 ± 0.016
0.733AsnIle: 0.733 ± 0.019
0.469AsnLys: 0.469 ± 0.015
1.771AsnLeu: 1.771 ± 0.031
0.333AsnMet: 0.333 ± 0.013
0.495AsnAsn: 0.495 ± 0.019
1.469AsnPro: 1.469 ± 0.031
0.591AsnGln: 0.591 ± 0.018
1.277AsnArg: 1.277 ± 0.024
1.116AsnSer: 1.116 ± 0.029
1.26AsnThr: 1.26 ± 0.032
1.471AsnVal: 1.471 ± 0.027
0.314AsnTrp: 0.314 ± 0.014
0.482AsnTyr: 0.482 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
8.137ProAla: 8.137 ± 0.074
0.307ProCys: 0.307 ± 0.012
4.491ProAsp: 4.491 ± 0.052
4.234ProGlu: 4.234 ± 0.046
1.608ProPhe: 1.608 ± 0.028
6.719ProGly: 6.719 ± 0.059
1.336ProHis: 1.336 ± 0.026
1.377ProIle: 1.377 ± 0.029
1.266ProLys: 1.266 ± 0.028
5.298ProLeu: 5.298 ± 0.052
1.015ProMet: 1.015 ± 0.023
1.002ProAsn: 1.002 ± 0.023
3.208ProPro: 3.208 ± 0.058
1.601ProGln: 1.601 ± 0.032
3.632ProArg: 3.632 ± 0.05
3.176ProSer: 3.176 ± 0.046
3.151ProThr: 3.151 ± 0.046
5.371ProVal: 5.371 ± 0.052
0.9ProTrp: 0.9 ± 0.021
1.505ProTyr: 1.505 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.594GlnAla: 3.594 ± 0.045
0.149GlnCys: 0.149 ± 0.008
1.488GlnAsp: 1.488 ± 0.026
1.44GlnGlu: 1.44 ± 0.026
0.792GlnPhe: 0.792 ± 0.016
2.355GlnGly: 2.355 ± 0.039
0.707GlnHis: 0.707 ± 0.021
1.182GlnIle: 1.182 ± 0.021
0.657GlnLys: 0.657 ± 0.022
3.113GlnLeu: 3.113 ± 0.033
0.523GlnMet: 0.523 ± 0.016
0.578GlnAsn: 0.578 ± 0.021
1.666GlnPro: 1.666 ± 0.032
1.372GlnGln: 1.372 ± 0.042
2.242GlnArg: 2.242 ± 0.034
1.352GlnSer: 1.352 ± 0.025
1.32GlnThr: 1.32 ± 0.027
2.407GlnVal: 2.407 ± 0.036
0.437GlnTrp: 0.437 ± 0.014
0.637GlnTyr: 0.637 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
9.244ArgAla: 9.244 ± 0.087
0.522ArgCys: 0.522 ± 0.015
4.228ArgAsp: 4.228 ± 0.051
4.584ArgGlu: 4.584 ± 0.055
2.285ArgPhe: 2.285 ± 0.032
5.666ArgGly: 5.666 ± 0.051
1.97ArgHis: 1.97 ± 0.031
3.254ArgIle: 3.254 ± 0.041
1.636ArgLys: 1.636 ± 0.028
8.368ArgLeu: 8.368 ± 0.083
1.721ArgMet: 1.721 ± 0.026
1.382ArgAsn: 1.382 ± 0.026
4.459ArgPro: 4.459 ± 0.058
2.222ArgGln: 2.222 ± 0.032
6.985ArgArg: 6.985 ± 0.073
4.07ArgSer: 4.07 ± 0.044
5.424ArgThr: 5.424 ± 0.051
5.587ArgVal: 5.587 ± 0.057
1.329ArgTrp: 1.329 ± 0.025
1.727ArgTyr: 1.727 ± 0.028
0.001ArgXaa: 0.001 ± 0.001
Ser
7.078SerAla: 7.078 ± 0.06
0.388SerCys: 0.388 ± 0.012
3.051SerAsp: 3.051 ± 0.047
2.634SerGlu: 2.634 ± 0.03
1.679SerPhe: 1.679 ± 0.031
6.335SerGly: 6.335 ± 0.072
1.136SerHis: 1.136 ± 0.022
1.659SerIle: 1.659 ± 0.032
1.139SerLys: 1.139 ± 0.024
5.158SerLeu: 5.158 ± 0.052
1.128SerMet: 1.128 ± 0.024
1.039SerAsn: 1.039 ± 0.025
3.24SerPro: 3.24 ± 0.038
1.363SerGln: 1.363 ± 0.028
3.763SerArg: 3.763 ± 0.045
3.17SerSer: 3.17 ± 0.053
3.318SerThr: 3.318 ± 0.041
4.618SerVal: 4.618 ± 0.048
0.923SerTrp: 0.923 ± 0.02
1.373SerTyr: 1.373 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
8.697ThrAla: 8.697 ± 0.073
0.409ThrCys: 0.409 ± 0.014
3.786ThrAsp: 3.786 ± 0.05
3.311ThrGlu: 3.311 ± 0.042
1.715ThrPhe: 1.715 ± 0.033
6.766ThrGly: 6.766 ± 0.056
1.192ThrHis: 1.192 ± 0.024
1.882ThrIle: 1.882 ± 0.032
1.247ThrLys: 1.247 ± 0.029
5.874ThrLeu: 5.874 ± 0.054
1.017ThrMet: 1.017 ± 0.023
1.106ThrAsn: 1.106 ± 0.026
4.151ThrPro: 4.151 ± 0.061
1.422ThrGln: 1.422 ± 0.027
3.718ThrArg: 3.718 ± 0.046
3.373ThrSer: 3.373 ± 0.042
3.931ThrThr: 3.931 ± 0.054
6.17ThrVal: 6.17 ± 0.062
0.907ThrTrp: 0.907 ± 0.02
1.404ThrTyr: 1.404 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
10.204ValAla: 10.204 ± 0.08
0.697ValCys: 0.697 ± 0.018
4.925ValAsp: 4.925 ± 0.048
4.634ValGlu: 4.634 ± 0.052
2.402ValPhe: 2.402 ± 0.035
6.541ValGly: 6.541 ± 0.054
1.888ValHis: 1.888 ± 0.03
3.117ValIle: 3.117 ± 0.045
1.843ValLys: 1.843 ± 0.037
9.253ValLeu: 9.253 ± 0.08
1.48ValMet: 1.48 ± 0.025
1.771ValAsn: 1.771 ± 0.034
5.089ValPro: 5.089 ± 0.049
2.062ValGln: 2.062 ± 0.032
7.038ValArg: 7.038 ± 0.059
4.668ValSer: 4.668 ± 0.051
5.632ValThr: 5.632 ± 0.057
7.787ValVal: 7.787 ± 0.075
1.093ValTrp: 1.093 ± 0.024
1.577ValTyr: 1.577 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.572TrpAla: 1.572 ± 0.026
0.134TrpCys: 0.134 ± 0.008
0.842TrpAsp: 0.842 ± 0.019
0.724TrpGlu: 0.724 ± 0.017
0.512TrpPhe: 0.512 ± 0.016
1.043TrpGly: 1.043 ± 0.022
0.376TrpHis: 0.376 ± 0.013
0.587TrpIle: 0.587 ± 0.014
0.388TrpLys: 0.388 ± 0.016
1.72TrpLeu: 1.72 ± 0.028
0.312TrpMet: 0.312 ± 0.013
0.429TrpAsn: 0.429 ± 0.015
0.805TrpPro: 0.805 ± 0.022
0.615TrpGln: 0.615 ± 0.017
1.287TrpArg: 1.287 ± 0.026
0.973TrpSer: 0.973 ± 0.019
1.039TrpThr: 1.039 ± 0.023
0.937TrpVal: 0.937 ± 0.024
0.32TrpTrp: 0.32 ± 0.013
0.373TrpTyr: 0.373 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.809TyrAla: 2.809 ± 0.037
0.171TyrCys: 0.171 ± 0.008
1.538TyrAsp: 1.538 ± 0.025
1.263TyrGlu: 1.263 ± 0.022
0.703TyrPhe: 0.703 ± 0.017
2.409TyrGly: 2.409 ± 0.033
0.407TyrHis: 0.407 ± 0.012
0.551TyrIle: 0.551 ± 0.017
0.403TyrLys: 0.403 ± 0.015
2.201TyrLeu: 2.201 ± 0.031
0.264TyrMet: 0.264 ± 0.011
0.454TyrAsn: 0.454 ± 0.016
1.117TyrPro: 1.117 ± 0.023
0.643TyrGln: 0.643 ± 0.017
1.784TyrArg: 1.784 ± 0.032
1.026TyrSer: 1.026 ± 0.022
1.232TyrThr: 1.232 ± 0.025
1.682TyrVal: 1.682 ± 0.027
0.356TyrTrp: 0.356 ± 0.013
0.461TyrTyr: 0.461 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.032XaaXaa: 0.032 ± 0.011
Statistics based on 7086 proteins (2211488 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski