Amino acid dipepetide frequency for Gemmatirosa kalamazoonesis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.009AlaAla: 23.009 ± 0.201
1.206AlaCys: 1.206 ± 0.023
7.654AlaAsp: 7.654 ± 0.069
7.415AlaGlu: 7.415 ± 0.08
4.146AlaPhe: 4.146 ± 0.048
11.972AlaGly: 11.972 ± 0.082
2.722AlaHis: 2.722 ± 0.043
4.76AlaIle: 4.76 ± 0.047
2.386AlaLys: 2.386 ± 0.042
16.921AlaLeu: 16.921 ± 0.135
2.804AlaMet: 2.804 ± 0.035
2.499AlaAsn: 2.499 ± 0.036
8.617AlaPro: 8.617 ± 0.09
4.065AlaGln: 4.065 ± 0.047
14.075AlaArg: 14.075 ± 0.11
7.44AlaSer: 7.44 ± 0.069
8.267AlaThr: 8.267 ± 0.07
11.384AlaVal: 11.384 ± 0.089
2.112AlaTrp: 2.112 ± 0.035
2.8AlaTyr: 2.8 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.03CysAla: 1.03 ± 0.023
0.089CysCys: 0.089 ± 0.007
0.451CysAsp: 0.451 ± 0.015
0.322CysGlu: 0.322 ± 0.012
0.195CysPhe: 0.195 ± 0.01
0.793CysGly: 0.793 ± 0.022
0.173CysHis: 0.173 ± 0.009
0.189CysIle: 0.189 ± 0.009
0.094CysLys: 0.094 ± 0.008
0.495CysLeu: 0.495 ± 0.016
0.103CysMet: 0.103 ± 0.007
0.14CysAsn: 0.14 ± 0.008
0.348CysPro: 0.348 ± 0.014
0.12CysGln: 0.12 ± 0.007
0.54CysArg: 0.54 ± 0.016
0.347CysSer: 0.347 ± 0.014
0.423CysThr: 0.423 ± 0.015
0.57CysVal: 0.57 ± 0.016
0.107CysTrp: 0.107 ± 0.008
0.148CysTyr: 0.148 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
10.027AspAla: 10.027 ± 0.087
0.295AspCys: 0.295 ± 0.011
3.71AspAsp: 3.71 ± 0.051
3.267AspGlu: 3.267 ± 0.043
1.707AspPhe: 1.707 ± 0.03
5.856AspGly: 5.856 ± 0.071
1.006AspHis: 1.006 ± 0.021
1.486AspIle: 1.486 ± 0.03
0.75AspLys: 0.75 ± 0.024
5.04AspLeu: 5.04 ± 0.043
0.768AspMet: 0.768 ± 0.018
0.827AspAsn: 0.827 ± 0.023
3.992AspPro: 3.992 ± 0.047
1.2AspGln: 1.2 ± 0.024
4.85AspArg: 4.85 ± 0.043
2.816AspSer: 2.816 ± 0.05
2.804AspThr: 2.804 ± 0.042
6.095AspVal: 6.095 ± 0.048
0.843AspTrp: 0.843 ± 0.021
1.26AspTyr: 1.26 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
5.981GluAla: 5.981 ± 0.06
0.271GluCys: 0.271 ± 0.012
1.884GluAsp: 1.884 ± 0.031
2.264GluGlu: 2.264 ± 0.037
1.362GluPhe: 1.362 ± 0.025
3.337GluGly: 3.337 ± 0.038
1.255GluHis: 1.255 ± 0.024
2.205GluIle: 2.205 ± 0.034
0.99GluLys: 0.99 ± 0.025
5.546GluLeu: 5.546 ± 0.055
0.984GluMet: 0.984 ± 0.021
0.933GluAsn: 0.933 ± 0.024
2.88GluPro: 2.88 ± 0.04
1.71GluGln: 1.71 ± 0.031
6.58GluArg: 6.58 ± 0.077
2.486GluSer: 2.486 ± 0.032
2.568GluThr: 2.568 ± 0.033
3.505GluVal: 3.505 ± 0.04
0.743GluTrp: 0.743 ± 0.017
1.02GluTyr: 1.02 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
4.269PheAla: 4.269 ± 0.046
0.205PheCys: 0.205 ± 0.011
2.346PheAsp: 2.346 ± 0.03
1.535PheGlu: 1.535 ± 0.025
1.026PhePhe: 1.026 ± 0.029
3.275PheGly: 3.275 ± 0.042
0.64PheHis: 0.64 ± 0.016
0.77PheIle: 0.77 ± 0.019
0.48PheLys: 0.48 ± 0.015
2.582PheLeu: 2.582 ± 0.031
0.468PheMet: 0.468 ± 0.015
0.69PheAsn: 0.69 ± 0.021
1.48PhePro: 1.48 ± 0.028
0.767PheGln: 0.767 ± 0.021
2.367PheArg: 2.367 ± 0.031
1.478PheSer: 1.478 ± 0.028
2.074PheThr: 2.074 ± 0.035
2.97PheVal: 2.97 ± 0.039
0.453PheTrp: 0.453 ± 0.015
0.785PheTyr: 0.785 ± 0.021
0.0PheXaa: 0.0 ± 0.0
Gly
12.834GlyAla: 12.834 ± 0.097
0.688GlyCys: 0.688 ± 0.02
5.211GlyAsp: 5.211 ± 0.057
4.255GlyGlu: 4.255 ± 0.047
2.868GlyPhe: 2.868 ± 0.045
8.481GlyGly: 8.481 ± 0.081
1.685GlyHis: 1.685 ± 0.03
3.284GlyIle: 3.284 ± 0.043
1.933GlyLys: 1.933 ± 0.036
7.417GlyLeu: 7.417 ± 0.063
1.8GlyMet: 1.8 ± 0.028
1.77GlyAsn: 1.77 ± 0.034
3.944GlyPro: 3.944 ± 0.043
2.295GlyGln: 2.295 ± 0.034
7.888GlyArg: 7.888 ± 0.063
4.375GlySer: 4.375 ± 0.054
6.052GlyThr: 6.052 ± 0.07
7.97GlyVal: 7.97 ± 0.067
1.502GlyTrp: 1.502 ± 0.026
2.136GlyTyr: 2.136 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
3.021HisAla: 3.021 ± 0.046
0.146HisCys: 0.146 ± 0.009
1.385HisAsp: 1.385 ± 0.028
1.122HisGlu: 1.122 ± 0.021
0.694HisPhe: 0.694 ± 0.016
1.968HisGly: 1.968 ± 0.031
0.51HisHis: 0.51 ± 0.018
0.514HisIle: 0.514 ± 0.015
0.26HisLys: 0.26 ± 0.012
1.823HisLeu: 1.823 ± 0.029
0.312HisMet: 0.312 ± 0.013
0.355HisAsn: 0.355 ± 0.014
1.435HisPro: 1.435 ± 0.025
0.408HisGln: 0.408 ± 0.014
1.668HisArg: 1.668 ± 0.031
0.704HisSer: 0.704 ± 0.019
0.995HisThr: 0.995 ± 0.024
1.981HisVal: 1.981 ± 0.033
0.297HisTrp: 0.297 ± 0.011
0.485HisTyr: 0.485 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
5.332IleAla: 5.332 ± 0.06
0.188IleCys: 0.188 ± 0.01
2.452IleAsp: 2.452 ± 0.031
1.928IleGlu: 1.928 ± 0.03
0.866IlePhe: 0.866 ± 0.021
3.286IleGly: 3.286 ± 0.042
0.585IleHis: 0.585 ± 0.017
0.834IleIle: 0.834 ± 0.026
0.573IleLys: 0.573 ± 0.016
2.382IleLeu: 2.382 ± 0.036
0.419IleMet: 0.419 ± 0.014
0.718IleAsn: 0.718 ± 0.019
1.869IlePro: 1.869 ± 0.029
0.683IleGln: 0.683 ± 0.02
2.295IleArg: 2.295 ± 0.034
1.37IleSer: 1.37 ± 0.026
1.953IleThr: 1.953 ± 0.034
3.555IleVal: 3.555 ± 0.046
0.366IleTrp: 0.366 ± 0.011
0.712IleTyr: 0.712 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
2.066LysAla: 2.066 ± 0.035
0.093LysCys: 0.093 ± 0.007
0.984LysAsp: 0.984 ± 0.027
0.781LysGlu: 0.781 ± 0.023
0.518LysPhe: 0.518 ± 0.016
1.449LysGly: 1.449 ± 0.03
0.374LysHis: 0.374 ± 0.014
0.705LysIle: 0.705 ± 0.021
0.576LysLys: 0.576 ± 0.02
1.801LysLeu: 1.801 ± 0.032
0.35LysMet: 0.35 ± 0.012
0.439LysAsn: 0.439 ± 0.016
1.208LysPro: 1.208 ± 0.025
0.634LysGln: 0.634 ± 0.02
1.494LysArg: 1.494 ± 0.03
0.941LysSer: 0.941 ± 0.022
1.018LysThr: 1.018 ± 0.022
1.306LysVal: 1.306 ± 0.028
0.259LysTrp: 0.259 ± 0.012
0.503LysTyr: 0.503 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
16.322LeuAla: 16.322 ± 0.131
0.629LeuCys: 0.629 ± 0.016
6.384LeuAsp: 6.384 ± 0.058
4.463LeuGlu: 4.463 ± 0.055
2.942LeuPhe: 2.942 ± 0.036
9.781LeuGly: 9.781 ± 0.076
2.008LeuHis: 2.008 ± 0.034
2.251LeuIle: 2.251 ± 0.038
1.75LeuLys: 1.75 ± 0.033
10.457LeuLeu: 10.457 ± 0.081
1.309LeuMet: 1.309 ± 0.024
1.702LeuAsn: 1.702 ± 0.035
5.626LeuPro: 5.626 ± 0.049
2.338LeuGln: 2.338 ± 0.037
9.452LeuArg: 9.452 ± 0.073
4.849LeuSer: 4.849 ± 0.056
5.707LeuThr: 5.707 ± 0.051
9.272LeuVal: 9.272 ± 0.07
1.279LeuTrp: 1.279 ± 0.022
2.069LeuTyr: 2.069 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
2.015MetAla: 2.015 ± 0.027
0.091MetCys: 0.091 ± 0.006
0.744MetAsp: 0.744 ± 0.018
0.728MetGlu: 0.728 ± 0.019
0.442MetPhe: 0.442 ± 0.015
1.194MetGly: 1.194 ± 0.025
0.4MetHis: 0.4 ± 0.014
0.644MetIle: 0.644 ± 0.017
0.428MetLys: 0.428 ± 0.015
1.926MetLeu: 1.926 ± 0.03
0.335MetMet: 0.335 ± 0.012
0.426MetAsn: 0.426 ± 0.013
1.415MetPro: 1.415 ± 0.026
0.547MetGln: 0.547 ± 0.016
1.938MetArg: 1.938 ± 0.032
1.264MetSer: 1.264 ± 0.025
1.428MetThr: 1.428 ± 0.026
1.115MetVal: 1.115 ± 0.028
0.207MetTrp: 0.207 ± 0.009
0.327MetTyr: 0.327 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.96AsnAla: 2.96 ± 0.04
0.144AsnCys: 0.144 ± 0.008
1.158AsnAsp: 1.158 ± 0.024
0.862AsnGlu: 0.862 ± 0.021
0.656AsnPhe: 0.656 ± 0.021
2.18AsnGly: 2.18 ± 0.039
0.366AsnHis: 0.366 ± 0.013
0.645AsnIle: 0.645 ± 0.019
0.329AsnLys: 0.329 ± 0.014
1.81AsnLeu: 1.81 ± 0.031
0.297AsnMet: 0.297 ± 0.011
0.522AsnAsn: 0.522 ± 0.021
1.402AsnPro: 1.402 ± 0.028
0.511AsnGln: 0.511 ± 0.017
1.353AsnArg: 1.353 ± 0.026
0.781AsnSer: 0.781 ± 0.022
1.159AsnThr: 1.159 ± 0.029
2.084AsnVal: 2.084 ± 0.035
0.289AsnTrp: 0.289 ± 0.013
0.528AsnTyr: 0.528 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
8.418ProAla: 8.418 ± 0.066
0.298ProCys: 0.298 ± 0.014
4.019ProAsp: 4.019 ± 0.045
3.185ProGlu: 3.185 ± 0.044
1.818ProPhe: 1.818 ± 0.028
5.182ProGly: 5.182 ± 0.052
1.15ProHis: 1.15 ± 0.023
1.907ProIle: 1.907 ± 0.032
1.048ProLys: 1.048 ± 0.024
5.516ProLeu: 5.516 ± 0.048
1.165ProMet: 1.165 ± 0.02
1.769ProAsn: 1.769 ± 0.033
3.837ProPro: 3.837 ± 0.071
1.402ProGln: 1.402 ± 0.028
4.71ProArg: 4.71 ± 0.054
3.509ProSer: 3.509 ± 0.046
3.879ProThr: 3.879 ± 0.044
4.609ProVal: 4.609 ± 0.043
0.808ProTrp: 0.808 ± 0.022
1.236ProTyr: 1.236 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
3.179GlnAla: 3.179 ± 0.045
0.144GlnCys: 0.144 ± 0.008
1.027GlnAsp: 1.027 ± 0.022
1.016GlnGlu: 1.016 ± 0.023
0.876GlnPhe: 0.876 ± 0.022
1.895GlnGly: 1.895 ± 0.038
0.574GlnHis: 0.574 ± 0.017
1.127GlnIle: 1.127 ± 0.024
0.543GlnLys: 0.543 ± 0.017
3.061GlnLeu: 3.061 ± 0.041
0.498GlnMet: 0.498 ± 0.017
0.601GlnAsn: 0.601 ± 0.021
1.729GlnPro: 1.729 ± 0.031
1.118GlnGln: 1.118 ± 0.035
2.768GlnArg: 2.768 ± 0.036
1.375GlnSer: 1.375 ± 0.026
1.418GlnThr: 1.418 ± 0.028
1.997GlnVal: 1.997 ± 0.035
0.408GlnTrp: 0.408 ± 0.014
0.633GlnTyr: 0.633 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
13.633ArgAla: 13.633 ± 0.114
0.576ArgCys: 0.576 ± 0.017
5.811ArgAsp: 5.811 ± 0.063
4.984ArgGlu: 4.984 ± 0.051
3.122ArgPhe: 3.122 ± 0.039
7.168ArgGly: 7.168 ± 0.063
1.932ArgHis: 1.932 ± 0.033
3.102ArgIle: 3.102 ± 0.036
1.281ArgLys: 1.281 ± 0.025
9.383ArgLeu: 9.383 ± 0.074
1.971ArgMet: 1.971 ± 0.029
1.478ArgAsn: 1.478 ± 0.028
4.894ArgPro: 4.894 ± 0.051
2.33ArgGln: 2.33 ± 0.035
9.287ArgArg: 9.287 ± 0.092
4.038ArgSer: 4.038 ± 0.046
4.981ArgThr: 4.981 ± 0.045
8.26ArgVal: 8.26 ± 0.063
1.726ArgTrp: 1.726 ± 0.029
2.156ArgTyr: 2.156 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
7.201SerAla: 7.201 ± 0.072
0.366SerCys: 0.366 ± 0.013
2.73SerAsp: 2.73 ± 0.034
1.971SerGlu: 1.971 ± 0.028
1.688SerPhe: 1.688 ± 0.032
4.812SerGly: 4.812 ± 0.053
0.965SerHis: 0.965 ± 0.018
1.802SerIle: 1.802 ± 0.028
0.818SerLys: 0.818 ± 0.022
5.289SerLeu: 5.289 ± 0.051
0.925SerMet: 0.925 ± 0.018
1.045SerAsn: 1.045 ± 0.028
3.161SerPro: 3.161 ± 0.04
1.106SerGln: 1.106 ± 0.024
3.97SerArg: 3.97 ± 0.048
2.759SerSer: 2.759 ± 0.044
3.177SerThr: 3.177 ± 0.042
4.161SerVal: 4.161 ± 0.045
0.788SerTrp: 0.788 ± 0.02
1.243SerTyr: 1.243 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
7.594ThrAla: 7.594 ± 0.07
0.384ThrCys: 0.384 ± 0.015
3.042ThrAsp: 3.042 ± 0.041
2.26ThrGlu: 2.26 ± 0.031
2.103ThrPhe: 2.103 ± 0.036
5.344ThrGly: 5.344 ± 0.06
1.137ThrHis: 1.137 ± 0.026
2.34ThrIle: 2.34 ± 0.035
1.059ThrLys: 1.059 ± 0.025
6.758ThrLeu: 6.758 ± 0.064
1.067ThrMet: 1.067 ± 0.022
1.338ThrAsn: 1.338 ± 0.033
4.425ThrPro: 4.425 ± 0.05
1.376ThrGln: 1.376 ± 0.03
4.701ThrArg: 4.701 ± 0.046
3.207ThrSer: 3.207 ± 0.048
4.043ThrThr: 4.043 ± 0.06
5.38ThrVal: 5.38 ± 0.059
0.961ThrTrp: 0.961 ± 0.025
1.393ThrTyr: 1.393 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
12.843ValAla: 12.843 ± 0.102
0.581ValCys: 0.581 ± 0.017
5.284ValAsp: 5.284 ± 0.054
4.384ValGlu: 4.384 ± 0.05
2.299ValPhe: 2.299 ± 0.036
7.214ValGly: 7.214 ± 0.067
1.727ValHis: 1.727 ± 0.028
2.7ValIle: 2.7 ± 0.039
1.488ValLys: 1.488 ± 0.028
8.681ValLeu: 8.681 ± 0.081
1.36ValMet: 1.36 ± 0.028
1.845ValAsn: 1.845 ± 0.031
5.31ValPro: 5.31 ± 0.054
2.252ValGln: 2.252 ± 0.031
8.468ValArg: 8.468 ± 0.062
4.399ValSer: 4.399 ± 0.044
5.615ValThr: 5.615 ± 0.063
8.753ValVal: 8.753 ± 0.082
1.097ValTrp: 1.097 ± 0.023
1.782ValTyr: 1.782 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.514TrpAla: 1.514 ± 0.029
0.122TrpCys: 0.122 ± 0.007
0.748TrpAsp: 0.748 ± 0.021
0.628TrpGlu: 0.628 ± 0.017
0.474TrpPhe: 0.474 ± 0.016
1.112TrpGly: 1.112 ± 0.025
0.358TrpHis: 0.358 ± 0.014
0.607TrpIle: 0.607 ± 0.017
0.333TrpLys: 0.333 ± 0.014
1.656TrpLeu: 1.656 ± 0.028
0.32TrpMet: 0.32 ± 0.013
0.417TrpAsn: 0.417 ± 0.013
0.744TrpPro: 0.744 ± 0.019
0.519TrpGln: 0.519 ± 0.015
1.699TrpArg: 1.699 ± 0.03
0.914TrpSer: 0.914 ± 0.023
1.036TrpThr: 1.036 ± 0.019
1.012TrpVal: 1.012 ± 0.024
0.338TrpTrp: 0.338 ± 0.014
0.398TrpTyr: 0.398 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.013TyrAla: 3.013 ± 0.042
0.16TyrCys: 0.16 ± 0.008
1.576TyrAsp: 1.576 ± 0.03
1.133TyrGlu: 1.133 ± 0.022
0.86TyrPhe: 0.86 ± 0.018
2.086TyrGly: 2.086 ± 0.037
0.516TyrHis: 0.516 ± 0.015
0.496TyrIle: 0.496 ± 0.016
0.379TyrLys: 0.379 ± 0.013
2.046TyrLeu: 2.046 ± 0.035
0.354TyrMet: 0.354 ± 0.013
0.52TyrAsn: 0.52 ± 0.015
1.148TyrPro: 1.148 ± 0.025
0.605TyrGln: 0.605 ± 0.018
2.008TyrArg: 2.008 ± 0.029
0.925TyrSer: 0.925 ± 0.021
1.317TyrThr: 1.317 ± 0.029
2.103TyrVal: 2.103 ± 0.033
0.369TyrTrp: 0.369 ± 0.013
0.632TyrTyr: 0.632 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6261 proteins (2262513 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski