Amino acid dipepetide frequency for Desulfopila sp. IMCC35006

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.972AlaAla: 8.972 ± 0.105
1.114AlaCys: 1.114 ± 0.033
4.758AlaAsp: 4.758 ± 0.055
5.745AlaGlu: 5.745 ± 0.066
3.261AlaPhe: 3.261 ± 0.041
7.383AlaGly: 7.383 ± 0.087
1.59AlaHis: 1.59 ± 0.035
5.822AlaIle: 5.822 ± 0.064
4.283AlaLys: 4.283 ± 0.061
8.872AlaLeu: 8.872 ± 0.086
2.555AlaMet: 2.555 ± 0.045
2.793AlaAsn: 2.793 ± 0.049
2.949AlaPro: 2.949 ± 0.046
2.888AlaGln: 2.888 ± 0.036
4.564AlaArg: 4.564 ± 0.05
4.522AlaSer: 4.522 ± 0.058
4.604AlaThr: 4.604 ± 0.056
6.425AlaVal: 6.425 ± 0.079
0.902AlaTrp: 0.902 ± 0.029
2.236AlaTyr: 2.236 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
1.01CysAla: 1.01 ± 0.026
0.288CysCys: 0.288 ± 0.014
0.583CysAsp: 0.583 ± 0.02
0.602CysGlu: 0.602 ± 0.02
0.512CysPhe: 0.512 ± 0.018
1.255CysGly: 1.255 ± 0.035
0.437CysHis: 0.437 ± 0.026
0.743CysIle: 0.743 ± 0.023
0.511CysLys: 0.511 ± 0.02
1.288CysLeu: 1.288 ± 0.03
0.339CysMet: 0.339 ± 0.016
0.444CysAsn: 0.444 ± 0.016
0.784CysPro: 0.784 ± 0.027
0.444CysGln: 0.444 ± 0.018
0.848CysArg: 0.848 ± 0.026
0.901CysSer: 0.901 ± 0.025
0.672CysThr: 0.672 ± 0.02
0.746CysVal: 0.746 ± 0.023
0.15CysTrp: 0.15 ± 0.01
0.386CysTyr: 0.386 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.17AspAla: 4.17 ± 0.053
0.644AspCys: 0.644 ± 0.023
2.69AspAsp: 2.69 ± 0.046
3.294AspGlu: 3.294 ± 0.048
2.587AspPhe: 2.587 ± 0.037
3.92AspGly: 3.92 ± 0.063
1.166AspHis: 1.166 ± 0.026
4.126AspIle: 4.126 ± 0.057
2.929AspLys: 2.929 ± 0.047
5.571AspLeu: 5.571 ± 0.069
1.361AspMet: 1.361 ± 0.031
2.051AspAsn: 2.051 ± 0.038
2.602AspPro: 2.602 ± 0.039
1.912AspGln: 1.912 ± 0.033
3.0AspArg: 3.0 ± 0.048
2.979AspSer: 2.979 ± 0.042
2.729AspThr: 2.729 ± 0.05
3.282AspVal: 3.282 ± 0.051
0.683AspTrp: 0.683 ± 0.019
1.789AspTyr: 1.789 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
4.887GluAla: 4.887 ± 0.061
0.567GluCys: 0.567 ± 0.02
2.86GluAsp: 2.86 ± 0.038
4.167GluGlu: 4.167 ± 0.057
2.093GluPhe: 2.093 ± 0.042
3.612GluGly: 3.612 ± 0.043
1.285GluHis: 1.285 ± 0.031
4.723GluIle: 4.723 ± 0.063
4.927GluLys: 4.927 ± 0.068
6.248GluLeu: 6.248 ± 0.078
1.843GluMet: 1.843 ± 0.034
2.759GluAsn: 2.759 ± 0.042
2.002GluPro: 2.002 ± 0.035
2.872GluGln: 2.872 ± 0.048
3.251GluArg: 3.251 ± 0.053
3.163GluSer: 3.163 ± 0.048
3.202GluThr: 3.202 ± 0.043
3.894GluVal: 3.894 ± 0.056
0.614GluTrp: 0.614 ± 0.023
1.75GluTyr: 1.75 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
3.606PheAla: 3.606 ± 0.047
0.684PheCys: 0.684 ± 0.025
2.453PheAsp: 2.453 ± 0.04
1.957PheGlu: 1.957 ± 0.039
2.365PhePhe: 2.365 ± 0.05
3.273PheGly: 3.273 ± 0.052
0.92PheHis: 0.92 ± 0.029
2.739PheIle: 2.739 ± 0.047
1.805PheLys: 1.805 ± 0.031
4.601PheLeu: 4.601 ± 0.064
1.057PheMet: 1.057 ± 0.024
1.59PheAsn: 1.59 ± 0.031
1.838PhePro: 1.838 ± 0.037
1.41PheGln: 1.41 ± 0.028
1.983PheArg: 1.983 ± 0.037
3.252PheSer: 3.252 ± 0.048
2.651PheThr: 2.651 ± 0.042
2.813PheVal: 2.813 ± 0.042
0.56PheTrp: 0.56 ± 0.022
1.36PheTyr: 1.36 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
5.832GlyAla: 5.832 ± 0.071
1.181GlyCys: 1.181 ± 0.029
3.669GlyAsp: 3.669 ± 0.066
4.238GlyGlu: 4.238 ± 0.055
3.293GlyPhe: 3.293 ± 0.05
5.715GlyGly: 5.715 ± 0.083
1.593GlyHis: 1.593 ± 0.028
5.564GlyIle: 5.564 ± 0.069
4.633GlyLys: 4.633 ± 0.064
7.472GlyLeu: 7.472 ± 0.076
2.198GlyMet: 2.198 ± 0.04
2.629GlyAsn: 2.629 ± 0.049
2.336GlyPro: 2.336 ± 0.04
2.561GlyGln: 2.561 ± 0.039
4.145GlyArg: 4.145 ± 0.051
4.428GlySer: 4.428 ± 0.065
4.195GlyThr: 4.195 ± 0.079
5.069GlyVal: 5.069 ± 0.062
1.001GlyTrp: 1.001 ± 0.027
2.578GlyTyr: 2.578 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.575HisAla: 1.575 ± 0.034
0.375HisCys: 0.375 ± 0.017
1.146HisAsp: 1.146 ± 0.025
1.144HisGlu: 1.144 ± 0.028
1.052HisPhe: 1.052 ± 0.026
1.688HisGly: 1.688 ± 0.039
0.649HisHis: 0.649 ± 0.025
1.4HisIle: 1.4 ± 0.03
0.98HisLys: 0.98 ± 0.025
2.298HisLeu: 2.298 ± 0.044
0.512HisMet: 0.512 ± 0.018
0.817HisAsn: 0.817 ± 0.024
1.299HisPro: 1.299 ± 0.026
0.838HisGln: 0.838 ± 0.023
1.181HisArg: 1.181 ± 0.029
1.27HisSer: 1.27 ± 0.03
1.023HisThr: 1.023 ± 0.024
1.228HisVal: 1.228 ± 0.029
0.28HisTrp: 0.28 ± 0.014
0.702HisTyr: 0.702 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.153IleAla: 6.153 ± 0.068
0.93IleCys: 0.93 ± 0.025
4.257IleAsp: 4.257 ± 0.056
4.159IleGlu: 4.159 ± 0.061
2.97IlePhe: 2.97 ± 0.048
5.171IleGly: 5.171 ± 0.064
1.468IleHis: 1.468 ± 0.03
4.665IleIle: 4.665 ± 0.067
3.402IleLys: 3.402 ± 0.051
6.776IleLeu: 6.776 ± 0.069
1.542IleMet: 1.542 ± 0.035
2.687IleAsn: 2.687 ± 0.045
3.1IlePro: 3.1 ± 0.047
2.191IleGln: 2.191 ± 0.038
3.557IleArg: 3.557 ± 0.049
4.447IleSer: 4.447 ± 0.056
3.912IleThr: 3.912 ± 0.057
4.714IleVal: 4.714 ± 0.061
0.604IleTrp: 0.604 ± 0.021
1.741IleTyr: 1.741 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
4.285LysAla: 4.285 ± 0.071
0.484LysCys: 0.484 ± 0.018
2.924LysAsp: 2.924 ± 0.049
3.875LysGlu: 3.875 ± 0.057
1.669LysPhe: 1.669 ± 0.031
3.543LysGly: 3.543 ± 0.048
0.902LysHis: 0.902 ± 0.029
4.355LysIle: 4.355 ± 0.057
4.408LysLys: 4.408 ± 0.062
4.631LysLeu: 4.631 ± 0.057
1.644LysMet: 1.644 ± 0.032
2.738LysAsn: 2.738 ± 0.042
2.098LysPro: 2.098 ± 0.04
1.976LysGln: 1.976 ± 0.035
2.806LysArg: 2.806 ± 0.048
3.057LysSer: 3.057 ± 0.054
3.223LysThr: 3.223 ± 0.046
3.543LysVal: 3.543 ± 0.058
0.515LysTrp: 0.515 ± 0.021
1.543LysTyr: 1.543 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
10.432LeuAla: 10.432 ± 0.096
1.329LeuCys: 1.329 ± 0.034
5.494LeuAsp: 5.494 ± 0.064
5.883LeuGlu: 5.883 ± 0.075
4.678LeuPhe: 4.678 ± 0.066
7.194LeuGly: 7.194 ± 0.07
2.391LeuHis: 2.391 ± 0.041
6.174LeuIle: 6.174 ± 0.066
5.2LeuLys: 5.2 ± 0.059
11.574LeuLeu: 11.574 ± 0.131
2.297LeuMet: 2.297 ± 0.047
3.504LeuAsn: 3.504 ± 0.048
5.093LeuPro: 5.093 ± 0.06
4.522LeuGln: 4.522 ± 0.059
5.263LeuArg: 5.263 ± 0.068
6.831LeuSer: 6.831 ± 0.068
5.769LeuThr: 5.769 ± 0.069
6.98LeuVal: 6.98 ± 0.078
0.983LeuTrp: 0.983 ± 0.024
2.72LeuTyr: 2.72 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.705MetAla: 2.705 ± 0.047
0.24MetCys: 0.24 ± 0.012
1.405MetAsp: 1.405 ± 0.03
1.573MetGlu: 1.573 ± 0.031
0.891MetPhe: 0.891 ± 0.026
1.812MetGly: 1.812 ± 0.038
0.569MetHis: 0.569 ± 0.02
1.595MetIle: 1.595 ± 0.033
1.633MetLys: 1.633 ± 0.034
2.564MetLeu: 2.564 ± 0.04
0.675MetMet: 0.675 ± 0.022
1.064MetAsn: 1.064 ± 0.028
1.175MetPro: 1.175 ± 0.029
1.1MetGln: 1.1 ± 0.03
1.295MetArg: 1.295 ± 0.029
1.506MetSer: 1.506 ± 0.029
1.614MetThr: 1.614 ± 0.039
2.004MetVal: 2.004 ± 0.04
0.181MetTrp: 0.181 ± 0.011
0.569MetTyr: 0.569 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.581AsnAla: 2.581 ± 0.047
0.561AsnCys: 0.561 ± 0.023
1.914AsnAsp: 1.914 ± 0.05
1.924AsnGlu: 1.924 ± 0.038
1.584AsnPhe: 1.584 ± 0.035
2.653AsnGly: 2.653 ± 0.054
0.812AsnHis: 0.812 ± 0.023
2.932AsnIle: 2.932 ± 0.048
1.874AsnLys: 1.874 ± 0.04
3.964AsnLeu: 3.964 ± 0.051
0.917AsnMet: 0.917 ± 0.023
1.575AsnAsn: 1.575 ± 0.04
2.059AsnPro: 2.059 ± 0.038
1.346AsnGln: 1.346 ± 0.029
2.219AsnArg: 2.219 ± 0.036
2.313AsnSer: 2.313 ± 0.04
1.85AsnThr: 1.85 ± 0.042
2.344AsnVal: 2.344 ± 0.045
0.469AsnTrp: 0.469 ± 0.018
1.197AsnTyr: 1.197 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
3.942ProAla: 3.942 ± 0.058
0.439ProCys: 0.439 ± 0.019
2.822ProAsp: 2.822 ± 0.048
3.546ProGlu: 3.546 ± 0.053
1.979ProPhe: 1.979 ± 0.04
3.523ProGly: 3.523 ± 0.055
0.879ProHis: 0.879 ± 0.024
2.371ProIle: 2.371 ± 0.037
1.9ProLys: 1.9 ± 0.032
4.362ProLeu: 4.362 ± 0.057
0.971ProMet: 0.971 ± 0.025
1.26ProAsn: 1.26 ± 0.03
1.99ProPro: 1.99 ± 0.04
1.563ProGln: 1.563 ± 0.035
1.756ProArg: 1.756 ± 0.037
2.352ProSer: 2.352 ± 0.037
2.072ProThr: 2.072 ± 0.04
3.772ProVal: 3.772 ± 0.052
0.516ProTrp: 0.516 ± 0.018
1.258ProTyr: 1.258 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.457GlnAla: 3.457 ± 0.05
0.421GlnCys: 0.421 ± 0.016
1.801GlnAsp: 1.801 ± 0.03
2.503GlnGlu: 2.503 ± 0.044
1.391GlnPhe: 1.391 ± 0.025
2.531GlnGly: 2.531 ± 0.041
0.819GlnHis: 0.819 ± 0.021
2.388GlnIle: 2.388 ± 0.036
2.481GlnLys: 2.481 ± 0.044
3.991GlnLeu: 3.991 ± 0.046
1.052GlnMet: 1.052 ± 0.024
1.438GlnAsn: 1.438 ± 0.029
1.63GlnPro: 1.63 ± 0.033
2.008GlnGln: 2.008 ± 0.044
2.01GlnArg: 2.01 ± 0.038
2.143GlnSer: 2.143 ± 0.033
2.026GlnThr: 2.026 ± 0.032
2.562GlnVal: 2.562 ± 0.046
0.452GlnTrp: 0.452 ± 0.016
1.056GlnTyr: 1.056 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.739ArgAla: 3.739 ± 0.056
0.644ArgCys: 0.644 ± 0.022
2.767ArgAsp: 2.767 ± 0.041
3.565ArgGlu: 3.565 ± 0.054
2.453ArgPhe: 2.453 ± 0.047
3.093ArgGly: 3.093 ± 0.05
1.257ArgHis: 1.257 ± 0.029
3.889ArgIle: 3.889 ± 0.055
3.228ArgLys: 3.228 ± 0.049
5.843ArgLeu: 5.843 ± 0.07
1.463ArgMet: 1.463 ± 0.033
2.013ArgAsn: 2.013 ± 0.032
2.104ArgPro: 2.104 ± 0.032
2.558ArgGln: 2.558 ± 0.041
3.053ArgArg: 3.053 ± 0.053
2.985ArgSer: 2.985 ± 0.044
2.577ArgThr: 2.577 ± 0.038
3.367ArgVal: 3.367 ± 0.047
0.579ArgTrp: 0.579 ± 0.019
1.801ArgTyr: 1.801 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.827SerAla: 4.827 ± 0.056
0.884SerCys: 0.884 ± 0.029
2.938SerAsp: 2.938 ± 0.046
3.304SerGlu: 3.304 ± 0.05
2.936SerPhe: 2.936 ± 0.05
5.268SerGly: 5.268 ± 0.07
1.284SerHis: 1.284 ± 0.029
3.984SerIle: 3.984 ± 0.057
2.61SerLys: 2.61 ± 0.042
6.746SerLeu: 6.746 ± 0.055
1.677SerMet: 1.677 ± 0.031
1.838SerAsn: 1.838 ± 0.04
2.759SerPro: 2.759 ± 0.039
2.089SerGln: 2.089 ± 0.035
3.347SerArg: 3.347 ± 0.045
3.997SerSer: 3.997 ± 0.064
3.103SerThr: 3.103 ± 0.055
3.915SerVal: 3.915 ± 0.05
0.807SerTrp: 0.807 ± 0.025
1.835SerTyr: 1.835 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
4.842ThrAla: 4.842 ± 0.061
0.642ThrCys: 0.642 ± 0.021
2.866ThrAsp: 2.866 ± 0.054
3.089ThrGlu: 3.089 ± 0.042
2.241ThrPhe: 2.241 ± 0.037
5.016ThrGly: 5.016 ± 0.073
1.023ThrHis: 1.023 ± 0.023
3.996ThrIle: 3.996 ± 0.051
2.246ThrLys: 2.246 ± 0.043
5.847ThrLeu: 5.847 ± 0.068
1.383ThrMet: 1.383 ± 0.033
1.774ThrAsn: 1.774 ± 0.038
2.709ThrPro: 2.709 ± 0.048
1.486ThrGln: 1.486 ± 0.033
2.64ThrArg: 2.64 ± 0.046
3.16ThrSer: 3.16 ± 0.062
3.176ThrThr: 3.176 ± 0.055
4.237ThrVal: 4.237 ± 0.06
0.563ThrTrp: 0.563 ± 0.021
1.452ThrTyr: 1.452 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
6.259ValAla: 6.259 ± 0.061
0.919ValCys: 0.919 ± 0.027
3.983ValAsp: 3.983 ± 0.053
4.142ValGlu: 4.142 ± 0.057
3.063ValPhe: 3.063 ± 0.047
4.719ValGly: 4.719 ± 0.059
1.413ValHis: 1.413 ± 0.032
4.691ValIle: 4.691 ± 0.051
3.35ValLys: 3.35 ± 0.047
7.008ValLeu: 7.008 ± 0.076
1.759ValMet: 1.759 ± 0.034
2.53ValAsn: 2.53 ± 0.051
2.908ValPro: 2.908 ± 0.043
2.392ValGln: 2.392 ± 0.042
3.503ValArg: 3.503 ± 0.053
4.279ValSer: 4.279 ± 0.055
3.884ValThr: 3.884 ± 0.056
5.232ValVal: 5.232 ± 0.067
0.649ValTrp: 0.649 ± 0.021
1.796ValTyr: 1.796 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.792TrpAla: 0.792 ± 0.021
0.133TrpCys: 0.133 ± 0.009
0.519TrpAsp: 0.519 ± 0.019
0.561TrpGlu: 0.561 ± 0.02
0.516TrpPhe: 0.516 ± 0.02
0.701TrpGly: 0.701 ± 0.021
0.284TrpHis: 0.284 ± 0.013
0.655TrpIle: 0.655 ± 0.02
0.564TrpLys: 0.564 ± 0.018
1.321TrpLeu: 1.321 ± 0.035
0.317TrpMet: 0.317 ± 0.017
0.456TrpAsn: 0.456 ± 0.019
0.474TrpPro: 0.474 ± 0.018
0.707TrpGln: 0.707 ± 0.022
0.635TrpArg: 0.635 ± 0.019
0.675TrpSer: 0.675 ± 0.018
0.513TrpThr: 0.513 ± 0.019
0.709TrpVal: 0.709 ± 0.021
0.162TrpTrp: 0.162 ± 0.01
0.368TrpTyr: 0.368 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.173TyrAla: 2.173 ± 0.038
0.464TyrCys: 0.464 ± 0.018
1.636TyrAsp: 1.636 ± 0.03
1.448TyrGlu: 1.448 ± 0.032
1.383TyrPhe: 1.383 ± 0.032
2.265TyrGly: 2.265 ± 0.038
0.714TyrHis: 0.714 ± 0.023
1.633TyrIle: 1.633 ± 0.035
1.255TyrLys: 1.255 ± 0.03
3.335TyrLeu: 3.335 ± 0.044
0.609TyrMet: 0.609 ± 0.02
1.127TyrAsn: 1.127 ± 0.026
1.364TyrPro: 1.364 ± 0.028
1.298TyrGln: 1.298 ± 0.029
1.925TyrArg: 1.925 ± 0.034
1.888TyrSer: 1.888 ± 0.036
1.591TyrThr: 1.591 ± 0.039
1.641TyrVal: 1.641 ± 0.033
0.388TyrTrp: 0.388 ± 0.016
1.045TyrTyr: 1.045 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4729 proteins (1606768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski