Amino acid dipepetide frequency for Torrubiella hemipterigena

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.746AlaAla: 10.746 ± 0.068
1.173AlaCys: 1.173 ± 0.018
4.746AlaAsp: 4.746 ± 0.033
5.082AlaGlu: 5.082 ± 0.047
3.238AlaPhe: 3.238 ± 0.027
5.971AlaGly: 5.971 ± 0.04
1.74AlaHis: 1.74 ± 0.017
4.714AlaIle: 4.714 ± 0.033
4.625AlaLys: 4.625 ± 0.036
7.998AlaLeu: 7.998 ± 0.044
2.286AlaMet: 2.286 ± 0.022
3.368AlaAsn: 3.368 ± 0.023
4.957AlaPro: 4.957 ± 0.049
3.496AlaGln: 3.496 ± 0.03
4.683AlaArg: 4.683 ± 0.033
7.64AlaSer: 7.64 ± 0.042
5.896AlaThr: 5.896 ± 0.034
5.768AlaVal: 5.768 ± 0.04
1.22AlaTrp: 1.22 ± 0.018
2.236AlaTyr: 2.236 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.015
0.25CysCys: 0.25 ± 0.009
0.711CysAsp: 0.711 ± 0.013
0.584CysGlu: 0.584 ± 0.011
0.552CysPhe: 0.552 ± 0.01
0.955CysGly: 0.955 ± 0.018
0.335CysHis: 0.335 ± 0.008
0.772CysIle: 0.772 ± 0.012
0.609CysLys: 0.609 ± 0.01
1.255CysLeu: 1.255 ± 0.018
0.285CysMet: 0.285 ± 0.007
0.489CysAsn: 0.489 ± 0.011
0.673CysPro: 0.673 ± 0.014
0.482CysGln: 0.482 ± 0.01
0.747CysArg: 0.747 ± 0.013
0.935CysSer: 0.935 ± 0.014
0.745CysThr: 0.745 ± 0.013
0.794CysVal: 0.794 ± 0.01
0.194CysTrp: 0.194 ± 0.007
0.353CysTyr: 0.353 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
5.4AspAla: 5.4 ± 0.036
0.681AspCys: 0.681 ± 0.014
4.788AspAsp: 4.788 ± 0.052
4.598AspGlu: 4.598 ± 0.042
2.238AspPhe: 2.238 ± 0.022
4.397AspGly: 4.397 ± 0.033
1.179AspHis: 1.179 ± 0.012
3.269AspIle: 3.269 ± 0.024
2.801AspLys: 2.801 ± 0.028
4.907AspLeu: 4.907 ± 0.031
1.483AspMet: 1.483 ± 0.016
2.137AspAsn: 2.137 ± 0.019
2.922AspPro: 2.922 ± 0.024
1.83AspGln: 1.83 ± 0.019
2.888AspArg: 2.888 ± 0.03
4.145AspSer: 4.145 ± 0.032
3.056AspThr: 3.056 ± 0.023
3.773AspVal: 3.773 ± 0.027
0.899AspTrp: 0.899 ± 0.013
1.642AspTyr: 1.642 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.513GluAla: 5.513 ± 0.051
0.626GluCys: 0.626 ± 0.012
4.06GluAsp: 4.06 ± 0.04
4.868GluGlu: 4.868 ± 0.068
1.949GluPhe: 1.949 ± 0.019
3.213GluGly: 3.213 ± 0.025
1.362GluHis: 1.362 ± 0.016
2.89GluIle: 2.89 ± 0.022
3.379GluLys: 3.379 ± 0.027
4.998GluLeu: 4.998 ± 0.036
1.515GluMet: 1.515 ± 0.016
2.177GluAsn: 2.177 ± 0.021
2.691GluPro: 2.691 ± 0.036
2.443GluGln: 2.443 ± 0.023
3.404GluArg: 3.404 ± 0.029
4.04GluSer: 4.04 ± 0.026
3.438GluThr: 3.438 ± 0.026
3.228GluVal: 3.228 ± 0.028
0.857GluTrp: 0.857 ± 0.013
1.624GluTyr: 1.624 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.218PheAla: 3.218 ± 0.026
0.566PheCys: 0.566 ± 0.009
2.402PheAsp: 2.402 ± 0.022
2.064PheGlu: 2.064 ± 0.019
1.642PhePhe: 1.642 ± 0.023
2.917PheGly: 2.917 ± 0.031
0.907PheHis: 0.907 ± 0.015
1.911PheIle: 1.911 ± 0.025
1.612PheLys: 1.612 ± 0.019
3.365PheLeu: 3.365 ± 0.03
0.848PheMet: 0.848 ± 0.012
1.548PheAsn: 1.548 ± 0.017
1.792PhePro: 1.792 ± 0.021
1.411PheGln: 1.411 ± 0.017
1.816PheArg: 1.816 ± 0.018
2.937PheSer: 2.937 ± 0.023
2.255PheThr: 2.255 ± 0.022
2.428PheVal: 2.428 ± 0.023
0.676PheTrp: 0.676 ± 0.014
1.172PheTyr: 1.172 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.48GlyAla: 5.48 ± 0.04
0.914GlyCys: 0.914 ± 0.015
3.755GlyAsp: 3.755 ± 0.027
3.302GlyGlu: 3.302 ± 0.025
2.79GlyPhe: 2.79 ± 0.028
5.517GlyGly: 5.517 ± 0.055
1.657GlyHis: 1.657 ± 0.02
3.631GlyIle: 3.631 ± 0.03
3.588GlyLys: 3.588 ± 0.031
5.901GlyLeu: 5.901 ± 0.04
1.644GlyMet: 1.644 ± 0.019
2.747GlyAsn: 2.747 ± 0.033
3.112GlyPro: 3.112 ± 0.028
2.576GlyGln: 2.576 ± 0.024
3.76GlyArg: 3.76 ± 0.029
5.621GlySer: 5.621 ± 0.039
3.926GlyThr: 3.926 ± 0.032
4.181GlyVal: 4.181 ± 0.031
1.153GlyTrp: 1.153 ± 0.016
2.167GlyTyr: 2.167 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.846HisAla: 1.846 ± 0.019
0.355HisCys: 0.355 ± 0.009
1.351HisAsp: 1.351 ± 0.015
1.237HisGlu: 1.237 ± 0.017
0.883HisPhe: 0.883 ± 0.013
1.712HisGly: 1.712 ± 0.019
0.789HisHis: 0.789 ± 0.015
1.254HisIle: 1.254 ± 0.017
0.95HisLys: 0.95 ± 0.015
2.127HisLeu: 2.127 ± 0.023
0.554HisMet: 0.554 ± 0.011
0.881HisAsn: 0.881 ± 0.014
1.428HisPro: 1.428 ± 0.019
0.965HisGln: 0.965 ± 0.016
1.421HisArg: 1.421 ± 0.017
1.735HisSer: 1.735 ± 0.021
1.266HisThr: 1.266 ± 0.016
1.402HisVal: 1.402 ± 0.017
0.339HisTrp: 0.339 ± 0.008
0.683HisTyr: 0.683 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.622IleAla: 4.622 ± 0.036
0.81IleCys: 0.81 ± 0.015
3.025IleAsp: 3.025 ± 0.021
2.834IleGlu: 2.834 ± 0.024
2.058IlePhe: 2.058 ± 0.023
3.318IleGly: 3.318 ± 0.033
1.215IleHis: 1.215 ± 0.015
2.721IleIle: 2.721 ± 0.028
2.358IleLys: 2.358 ± 0.022
4.653IleLeu: 4.653 ± 0.035
1.163IleMet: 1.163 ± 0.015
1.952IleAsn: 1.952 ± 0.018
3.0IlePro: 3.0 ± 0.023
2.028IleGln: 2.028 ± 0.02
2.767IleArg: 2.767 ± 0.025
3.874IleSer: 3.874 ± 0.032
3.023IleThr: 3.023 ± 0.024
3.363IleVal: 3.363 ± 0.032
0.737IleTrp: 0.737 ± 0.013
1.454IleTyr: 1.454 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.653LysAla: 4.653 ± 0.039
0.552LysCys: 0.552 ± 0.01
3.052LysAsp: 3.052 ± 0.026
3.214LysGlu: 3.214 ± 0.03
1.544LysPhe: 1.544 ± 0.017
2.943LysGly: 2.943 ± 0.026
1.17LysHis: 1.17 ± 0.016
2.357LysIle: 2.357 ± 0.023
3.343LysLys: 3.343 ± 0.047
4.326LysLeu: 4.326 ± 0.03
1.144LysMet: 1.144 ± 0.014
1.852LysAsn: 1.852 ± 0.022
2.826LysPro: 2.826 ± 0.025
2.044LysGln: 2.044 ± 0.019
3.266LysArg: 3.266 ± 0.029
3.593LysSer: 3.593 ± 0.03
3.182LysThr: 3.182 ± 0.025
2.838LysVal: 2.838 ± 0.024
0.681LysTrp: 0.681 ± 0.01
1.448LysTyr: 1.448 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
8.286LeuAla: 8.286 ± 0.048
1.224LeuCys: 1.224 ± 0.016
5.245LeuAsp: 5.245 ± 0.035
5.191LeuGlu: 5.191 ± 0.034
3.382LeuPhe: 3.382 ± 0.029
5.774LeuGly: 5.774 ± 0.044
2.145LeuHis: 2.145 ± 0.021
4.045LeuIle: 4.045 ± 0.032
4.084LeuLys: 4.084 ± 0.033
8.37LeuLeu: 8.37 ± 0.053
1.861LeuMet: 1.861 ± 0.021
3.191LeuAsn: 3.191 ± 0.031
5.284LeuPro: 5.284 ± 0.036
3.868LeuGln: 3.868 ± 0.029
5.351LeuArg: 5.351 ± 0.037
7.043LeuSer: 7.043 ± 0.044
4.842LeuThr: 4.842 ± 0.031
5.41LeuVal: 5.41 ± 0.04
1.19LeuTrp: 1.19 ± 0.018
2.412LeuTyr: 2.412 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.486MetAla: 2.486 ± 0.021
0.248MetCys: 0.248 ± 0.006
1.431MetAsp: 1.431 ± 0.016
1.335MetGlu: 1.335 ± 0.016
0.809MetPhe: 0.809 ± 0.012
1.496MetGly: 1.496 ± 0.017
0.543MetHis: 0.543 ± 0.009
1.038MetIle: 1.038 ± 0.012
1.128MetLys: 1.128 ± 0.017
2.042MetLeu: 2.042 ± 0.019
0.675MetMet: 0.675 ± 0.012
0.832MetAsn: 0.832 ± 0.013
1.38MetPro: 1.38 ± 0.018
0.997MetGln: 0.997 ± 0.016
1.32MetArg: 1.32 ± 0.017
2.002MetSer: 2.002 ± 0.02
1.417MetThr: 1.417 ± 0.017
1.413MetVal: 1.413 ± 0.016
0.282MetTrp: 0.282 ± 0.007
0.615MetTyr: 0.615 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.346AsnAla: 3.346 ± 0.026
0.471AsnCys: 0.471 ± 0.009
2.14AsnAsp: 2.14 ± 0.02
2.009AsnGlu: 2.009 ± 0.02
1.426AsnPhe: 1.426 ± 0.017
3.207AsnGly: 3.207 ± 0.031
0.854AsnHis: 0.854 ± 0.013
2.185AsnIle: 2.185 ± 0.023
1.81AsnLys: 1.81 ± 0.02
3.261AsnLeu: 3.261 ± 0.027
1.014AsnMet: 1.014 ± 0.015
1.715AsnAsn: 1.715 ± 0.025
2.338AsnPro: 2.338 ± 0.022
1.394AsnGln: 1.394 ± 0.017
1.939AsnArg: 1.939 ± 0.017
2.775AsnSer: 2.775 ± 0.026
2.344AsnThr: 2.344 ± 0.023
2.381AsnVal: 2.381 ± 0.022
0.595AsnTrp: 0.595 ± 0.01
1.104AsnTyr: 1.104 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.384ProAla: 5.384 ± 0.046
0.532ProCys: 0.532 ± 0.011
3.061ProAsp: 3.061 ± 0.024
3.487ProGlu: 3.487 ± 0.031
1.977ProPhe: 1.977 ± 0.022
3.705ProGly: 3.705 ± 0.035
1.147ProHis: 1.147 ± 0.017
2.582ProIle: 2.582 ± 0.022
2.769ProLys: 2.769 ± 0.024
4.453ProLeu: 4.453 ± 0.034
1.138ProMet: 1.138 ± 0.017
2.228ProAsn: 2.228 ± 0.027
4.438ProPro: 4.438 ± 0.07
2.282ProGln: 2.282 ± 0.028
3.032ProArg: 3.032 ± 0.033
5.338ProSer: 5.338 ± 0.05
3.886ProThr: 3.886 ± 0.034
3.597ProVal: 3.597 ± 0.033
0.705ProTrp: 0.705 ± 0.012
1.424ProTyr: 1.424 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.556GlnAla: 3.556 ± 0.031
0.453GlnCys: 0.453 ± 0.009
2.207GlnAsp: 2.207 ± 0.022
2.207GlnGlu: 2.207 ± 0.023
1.356GlnPhe: 1.356 ± 0.017
2.397GlnGly: 2.397 ± 0.026
1.121GlnHis: 1.121 ± 0.02
1.928GlnIle: 1.928 ± 0.02
1.928GlnLys: 1.928 ± 0.019
3.588GlnLeu: 3.588 ± 0.033
0.968GlnMet: 0.968 ± 0.015
1.567GlnAsn: 1.567 ± 0.02
2.482GlnPro: 2.482 ± 0.031
2.582GlnGln: 2.582 ± 0.059
2.59GlnArg: 2.59 ± 0.024
3.051GlnSer: 3.051 ± 0.028
2.383GlnThr: 2.383 ± 0.02
2.209GlnVal: 2.209 ± 0.021
0.578GlnTrp: 0.578 ± 0.011
1.214GlnTyr: 1.214 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.479ArgAla: 4.479 ± 0.031
0.716ArgCys: 0.716 ± 0.014
3.311ArgAsp: 3.311 ± 0.037
3.273ArgGlu: 3.273 ± 0.03
2.072ArgPhe: 2.072 ± 0.021
3.426ArgGly: 3.426 ± 0.03
1.517ArgHis: 1.517 ± 0.02
2.833ArgIle: 2.833 ± 0.021
3.139ArgLys: 3.139 ± 0.025
5.194ArgLeu: 5.194 ± 0.036
1.275ArgMet: 1.275 ± 0.017
2.155ArgAsn: 2.155 ± 0.021
3.153ArgPro: 3.153 ± 0.031
2.618ArgGln: 2.618 ± 0.025
4.607ArgArg: 4.607 ± 0.041
4.376ArgSer: 4.376 ± 0.04
3.054ArgThr: 3.054 ± 0.028
3.106ArgVal: 3.106 ± 0.026
0.872ArgTrp: 0.872 ± 0.012
1.586ArgTyr: 1.586 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.744SerAla: 6.744 ± 0.038
0.927SerCys: 0.927 ± 0.014
4.271SerAsp: 4.271 ± 0.031
3.841SerGlu: 3.841 ± 0.03
3.025SerPhe: 3.025 ± 0.027
5.349SerGly: 5.349 ± 0.041
1.833SerHis: 1.833 ± 0.02
4.202SerIle: 4.202 ± 0.035
3.959SerLys: 3.959 ± 0.031
6.944SerLeu: 6.944 ± 0.047
1.827SerMet: 1.827 ± 0.02
3.042SerAsn: 3.042 ± 0.023
5.023SerPro: 5.023 ± 0.053
3.203SerGln: 3.203 ± 0.028
4.647SerArg: 4.647 ± 0.039
8.202SerSer: 8.202 ± 0.072
5.395SerThr: 5.395 ± 0.043
4.611SerVal: 4.611 ± 0.031
1.126SerTrp: 1.126 ± 0.015
2.107SerTyr: 2.107 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.595ThrAla: 5.595 ± 0.033
0.775ThrCys: 0.775 ± 0.012
3.043ThrAsp: 3.043 ± 0.021
3.17ThrGlu: 3.17 ± 0.029
2.274ThrPhe: 2.274 ± 0.021
4.147ThrGly: 4.147 ± 0.035
1.256ThrHis: 1.256 ± 0.016
3.341ThrIle: 3.341 ± 0.029
2.977ThrLys: 2.977 ± 0.026
5.345ThrLeu: 5.345 ± 0.041
1.342ThrMet: 1.342 ± 0.014
2.285ThrAsn: 2.285 ± 0.021
4.112ThrPro: 4.112 ± 0.036
2.101ThrGln: 2.101 ± 0.018
2.924ThrArg: 2.924 ± 0.026
5.254ThrSer: 5.254 ± 0.041
4.464ThrThr: 4.464 ± 0.038
3.921ThrVal: 3.921 ± 0.033
0.858ThrTrp: 0.858 ± 0.014
1.636ThrTyr: 1.636 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.783ValAla: 5.783 ± 0.04
0.847ValCys: 0.847 ± 0.013
3.844ValAsp: 3.844 ± 0.029
3.692ValGlu: 3.692 ± 0.032
2.493ValPhe: 2.493 ± 0.022
3.953ValGly: 3.953 ± 0.033
1.341ValHis: 1.341 ± 0.016
3.056ValIle: 3.056 ± 0.029
2.929ValLys: 2.929 ± 0.027
5.527ValLeu: 5.527 ± 0.04
1.394ValMet: 1.394 ± 0.018
2.263ValAsn: 2.263 ± 0.023
3.475ValPro: 3.475 ± 0.025
2.328ValGln: 2.328 ± 0.021
3.184ValArg: 3.184 ± 0.025
4.592ValSer: 4.592 ± 0.034
3.62ValThr: 3.62 ± 0.029
4.323ValVal: 4.323 ± 0.034
0.857ValTrp: 0.857 ± 0.013
1.772ValTyr: 1.772 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
1.181TrpAla: 1.181 ± 0.015
0.186TrpCys: 0.186 ± 0.006
0.916TrpAsp: 0.916 ± 0.013
0.781TrpGlu: 0.781 ± 0.012
0.553TrpPhe: 0.553 ± 0.01
0.9TrpGly: 0.9 ± 0.012
0.357TrpHis: 0.357 ± 0.009
0.763TrpIle: 0.763 ± 0.012
0.792TrpLys: 0.792 ± 0.014
1.375TrpLeu: 1.375 ± 0.018
0.386TrpMet: 0.386 ± 0.008
0.645TrpAsn: 0.645 ± 0.013
0.636TrpPro: 0.636 ± 0.011
0.612TrpGln: 0.612 ± 0.011
0.884TrpArg: 0.884 ± 0.013
1.026TrpSer: 1.026 ± 0.012
0.946TrpThr: 0.946 ± 0.017
0.88TrpVal: 0.88 ± 0.015
0.283TrpTrp: 0.283 ± 0.007
0.443TrpTyr: 0.443 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.248TyrAla: 2.248 ± 0.02
0.431TyrCys: 0.431 ± 0.009
1.676TyrAsp: 1.676 ± 0.019
1.509TyrGlu: 1.509 ± 0.017
1.223TyrPhe: 1.223 ± 0.014
2.124TyrGly: 2.124 ± 0.022
0.712TyrHis: 0.712 ± 0.013
1.482TyrIle: 1.482 ± 0.018
1.249TyrLys: 1.249 ± 0.016
2.569TyrLeu: 2.569 ± 0.024
0.685TyrMet: 0.685 ± 0.011
1.231TyrAsn: 1.231 ± 0.015
1.444TyrPro: 1.444 ± 0.019
1.104TyrGln: 1.104 ± 0.015
1.532TyrArg: 1.532 ± 0.015
2.099TyrSer: 2.099 ± 0.02
1.683TyrThr: 1.683 ± 0.017
1.645TyrVal: 1.645 ± 0.018
0.446TyrTrp: 0.446 ± 0.011
0.983TyrTyr: 0.983 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11466 proteins (5553574 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski