Amino acid dipepetide frequency for Desulfosporosinus sp. OT

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.191AlaAla: 6.191 ± 0.082
0.892AlaCys: 0.892 ± 0.029
3.521AlaAsp: 3.521 ± 0.048
4.909AlaGlu: 4.909 ± 0.067
2.974AlaPhe: 2.974 ± 0.045
5.752AlaGly: 5.752 ± 0.073
1.365AlaHis: 1.365 ± 0.031
5.522AlaIle: 5.522 ± 0.063
4.845AlaLys: 4.845 ± 0.06
8.356AlaLeu: 8.356 ± 0.081
2.119AlaMet: 2.119 ± 0.041
2.842AlaAsn: 2.842 ± 0.043
2.255AlaPro: 2.255 ± 0.038
3.018AlaGln: 3.018 ± 0.045
3.424AlaArg: 3.424 ± 0.053
4.179AlaSer: 4.179 ± 0.058
3.784AlaThr: 3.784 ± 0.06
5.668AlaVal: 5.668 ± 0.073
0.756AlaTrp: 0.756 ± 0.02
2.3AlaTyr: 2.3 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.729CysAla: 0.729 ± 0.021
0.216CysCys: 0.216 ± 0.012
0.544CysAsp: 0.544 ± 0.018
0.629CysGlu: 0.629 ± 0.02
0.483CysPhe: 0.483 ± 0.02
1.18CysGly: 1.18 ± 0.032
0.342CysHis: 0.342 ± 0.021
0.745CysIle: 0.745 ± 0.022
0.542CysLys: 0.542 ± 0.02
1.128CysLeu: 1.128 ± 0.029
0.257CysMet: 0.257 ± 0.013
0.457CysAsn: 0.457 ± 0.02
0.661CysPro: 0.661 ± 0.028
0.441CysGln: 0.441 ± 0.016
0.56CysArg: 0.56 ± 0.02
0.78CysSer: 0.78 ± 0.024
0.613CysThr: 0.613 ± 0.022
0.69CysVal: 0.69 ± 0.023
0.121CysTrp: 0.121 ± 0.009
0.392CysTyr: 0.392 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.259AspAla: 3.259 ± 0.052
0.572AspCys: 0.572 ± 0.021
2.238AspAsp: 2.238 ± 0.05
3.464AspGlu: 3.464 ± 0.048
2.347AspPhe: 2.347 ± 0.039
3.304AspGly: 3.304 ± 0.057
0.935AspHis: 0.935 ± 0.023
3.936AspIle: 3.936 ± 0.053
3.081AspLys: 3.081 ± 0.044
5.448AspLeu: 5.448 ± 0.062
1.291AspMet: 1.291 ± 0.026
1.925AspAsn: 1.925 ± 0.041
2.123AspPro: 2.123 ± 0.04
1.96AspGln: 1.96 ± 0.036
2.268AspArg: 2.268 ± 0.041
2.792AspSer: 2.792 ± 0.042
2.398AspThr: 2.398 ± 0.044
3.449AspVal: 3.449 ± 0.053
0.59AspTrp: 0.59 ± 0.022
2.046AspTyr: 2.046 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.034GluAla: 5.034 ± 0.065
0.642GluCys: 0.642 ± 0.022
3.167GluAsp: 3.167 ± 0.042
5.242GluGlu: 5.242 ± 0.074
2.555GluPhe: 2.555 ± 0.042
4.134GluGly: 4.134 ± 0.061
1.277GluHis: 1.277 ± 0.032
5.544GluIle: 5.544 ± 0.053
4.877GluLys: 4.877 ± 0.064
6.883GluLeu: 6.883 ± 0.079
1.942GluMet: 1.942 ± 0.037
3.114GluAsn: 3.114 ± 0.052
1.85GluPro: 1.85 ± 0.038
2.858GluGln: 2.858 ± 0.049
3.613GluArg: 3.613 ± 0.059
3.353GluSer: 3.353 ± 0.046
3.347GluThr: 3.347 ± 0.044
4.787GluVal: 4.787 ± 0.062
0.702GluTrp: 0.702 ± 0.021
1.991GluTyr: 1.991 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.042
0.544PheCys: 0.544 ± 0.02
2.121PheAsp: 2.121 ± 0.035
2.494PheGlu: 2.494 ± 0.036
1.862PhePhe: 1.862 ± 0.039
3.111PheGly: 3.111 ± 0.045
0.767PheHis: 0.767 ± 0.021
3.106PheIle: 3.106 ± 0.051
2.233PheLys: 2.233 ± 0.039
4.351PheLeu: 4.351 ± 0.064
1.092PheMet: 1.092 ± 0.028
1.78PheAsn: 1.78 ± 0.033
1.656PhePro: 1.656 ± 0.039
1.381PheGln: 1.381 ± 0.033
1.727PheArg: 1.727 ± 0.039
3.078PheSer: 3.078 ± 0.048
2.209PheThr: 2.209 ± 0.037
2.818PheVal: 2.818 ± 0.047
0.526PheTrp: 0.526 ± 0.018
1.496PheTyr: 1.496 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
5.083GlyAla: 5.083 ± 0.07
1.094GlyCys: 1.094 ± 0.031
3.177GlyAsp: 3.177 ± 0.049
4.3GlyGlu: 4.3 ± 0.059
3.276GlyPhe: 3.276 ± 0.047
5.087GlyGly: 5.087 ± 0.078
1.345GlyHis: 1.345 ± 0.028
6.239GlyIle: 6.239 ± 0.069
4.941GlyLys: 4.941 ± 0.059
7.468GlyLeu: 7.468 ± 0.067
2.202GlyMet: 2.202 ± 0.04
2.799GlyAsn: 2.799 ± 0.048
1.835GlyPro: 1.835 ± 0.037
2.741GlyGln: 2.741 ± 0.048
3.099GlyArg: 3.099 ± 0.053
4.202GlySer: 4.202 ± 0.058
4.241GlyThr: 4.241 ± 0.058
5.435GlyVal: 5.435 ± 0.074
0.881GlyTrp: 0.881 ± 0.026
2.776GlyTyr: 2.776 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.191HisAla: 1.191 ± 0.029
0.274HisCys: 0.274 ± 0.013
0.912HisAsp: 0.912 ± 0.024
1.143HisGlu: 1.143 ± 0.024
0.88HisPhe: 0.88 ± 0.022
1.37HisGly: 1.37 ± 0.026
0.488HisHis: 0.488 ± 0.019
1.253HisIle: 1.253 ± 0.027
1.006HisLys: 1.006 ± 0.023
1.968HisLeu: 1.968 ± 0.038
0.461HisMet: 0.461 ± 0.018
0.789HisAsn: 0.789 ± 0.022
1.015HisPro: 1.015 ± 0.028
0.736HisGln: 0.736 ± 0.021
0.889HisArg: 0.889 ± 0.026
1.141HisSer: 1.141 ± 0.024
0.97HisThr: 0.97 ± 0.026
1.152HisVal: 1.152 ± 0.024
0.25HisTrp: 0.25 ± 0.013
0.709HisTyr: 0.709 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.029IleAla: 6.029 ± 0.076
0.871IleCys: 0.871 ± 0.026
3.867IleAsp: 3.867 ± 0.051
4.883IleGlu: 4.883 ± 0.066
3.012IlePhe: 3.012 ± 0.053
5.447IleGly: 5.447 ± 0.075
1.345IleHis: 1.345 ± 0.028
5.616IleIle: 5.616 ± 0.073
4.412IleLys: 4.412 ± 0.058
7.779IleLeu: 7.779 ± 0.076
1.892IleMet: 1.892 ± 0.042
3.344IleAsn: 3.344 ± 0.05
3.531IlePro: 3.531 ± 0.051
2.698IleGln: 2.698 ± 0.042
3.453IleArg: 3.453 ± 0.043
5.198IleSer: 5.198 ± 0.065
4.262IleThr: 4.262 ± 0.057
5.158IleVal: 5.158 ± 0.06
0.672IleTrp: 0.672 ± 0.021
2.216IleTyr: 2.216 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.787LysAla: 4.787 ± 0.062
0.545LysCys: 0.545 ± 0.019
3.416LysAsp: 3.416 ± 0.055
4.753LysGlu: 4.753 ± 0.063
1.965LysPhe: 1.965 ± 0.033
4.161LysGly: 4.161 ± 0.053
1.043LysHis: 1.043 ± 0.026
4.814LysIle: 4.814 ± 0.067
4.138LysLys: 4.138 ± 0.06
5.865LysLeu: 5.865 ± 0.067
1.794LysMet: 1.794 ± 0.033
2.918LysAsn: 2.918 ± 0.043
2.173LysPro: 2.173 ± 0.038
2.326LysGln: 2.326 ± 0.043
3.048LysArg: 3.048 ± 0.049
3.523LysSer: 3.523 ± 0.049
3.544LysThr: 3.544 ± 0.049
4.724LysVal: 4.724 ± 0.051
0.594LysTrp: 0.594 ± 0.02
2.035LysTyr: 2.035 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
8.643LeuAla: 8.643 ± 0.074
1.074LeuCys: 1.074 ± 0.028
5.254LeuAsp: 5.254 ± 0.061
6.793LeuGlu: 6.793 ± 0.091
4.194LeuPhe: 4.194 ± 0.067
7.75LeuGly: 7.75 ± 0.082
1.808LeuHis: 1.808 ± 0.031
7.37LeuIle: 7.37 ± 0.07
6.757LeuLys: 6.757 ± 0.062
10.265LeuLeu: 10.265 ± 0.097
2.617LeuMet: 2.617 ± 0.04
4.544LeuAsn: 4.544 ± 0.053
4.273LeuPro: 4.273 ± 0.058
3.703LeuGln: 3.703 ± 0.054
4.893LeuArg: 4.893 ± 0.06
7.175LeuSer: 7.175 ± 0.085
6.041LeuThr: 6.041 ± 0.063
6.96LeuVal: 6.96 ± 0.07
1.022LeuTrp: 1.022 ± 0.027
2.84LeuTyr: 2.84 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.258MetAla: 2.258 ± 0.039
0.227MetCys: 0.227 ± 0.012
1.435MetAsp: 1.435 ± 0.038
1.821MetGlu: 1.821 ± 0.033
0.955MetPhe: 0.955 ± 0.029
2.036MetGly: 2.036 ± 0.038
0.461MetHis: 0.461 ± 0.019
1.941MetIle: 1.941 ± 0.039
1.812MetLys: 1.812 ± 0.032
2.581MetLeu: 2.581 ± 0.043
0.732MetMet: 0.732 ± 0.026
1.331MetAsn: 1.331 ± 0.03
1.086MetPro: 1.086 ± 0.027
0.924MetGln: 0.924 ± 0.028
1.189MetArg: 1.189 ± 0.028
1.719MetSer: 1.719 ± 0.031
1.517MetThr: 1.517 ± 0.029
1.902MetVal: 1.902 ± 0.042
0.177MetTrp: 0.177 ± 0.01
0.634MetTyr: 0.634 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.818AsnAla: 2.818 ± 0.043
0.519AsnCys: 0.519 ± 0.02
1.997AsnAsp: 1.997 ± 0.041
2.592AsnGlu: 2.592 ± 0.045
1.649AsnPhe: 1.649 ± 0.033
2.984AsnGly: 2.984 ± 0.05
0.834AsnHis: 0.834 ± 0.022
3.343AsnIle: 3.343 ± 0.048
2.602AsnLys: 2.602 ± 0.045
4.481AsnLeu: 4.481 ± 0.054
1.037AsnMet: 1.037 ± 0.025
1.966AsnAsn: 1.966 ± 0.043
2.341AsnPro: 2.341 ± 0.042
1.707AsnGln: 1.707 ± 0.033
1.94AsnArg: 1.94 ± 0.037
2.704AsnSer: 2.704 ± 0.042
2.156AsnThr: 2.156 ± 0.043
2.961AsnVal: 2.961 ± 0.041
0.495AsnTrp: 0.495 ± 0.016
1.611AsnTyr: 1.611 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
2.554ProAla: 2.554 ± 0.041
0.383ProCys: 0.383 ± 0.016
2.143ProAsp: 2.143 ± 0.039
3.142ProGlu: 3.142 ± 0.052
1.743ProPhe: 1.743 ± 0.033
2.712ProGly: 2.712 ± 0.045
0.731ProHis: 0.731 ± 0.022
2.777ProIle: 2.777 ± 0.045
2.164ProLys: 2.164 ± 0.042
3.922ProLeu: 3.922 ± 0.052
0.915ProMet: 0.915 ± 0.028
1.673ProAsn: 1.673 ± 0.037
1.259ProPro: 1.259 ± 0.036
1.482ProGln: 1.482 ± 0.032
1.424ProArg: 1.424 ± 0.034
2.459ProSer: 2.459 ± 0.042
2.174ProThr: 2.174 ± 0.046
3.002ProVal: 3.002 ± 0.037
0.44ProTrp: 0.44 ± 0.018
1.297ProTyr: 1.297 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.247GlnAla: 3.247 ± 0.056
0.353GlnCys: 0.353 ± 0.015
1.823GlnAsp: 1.823 ± 0.034
2.913GlnGlu: 2.913 ± 0.047
1.311GlnPhe: 1.311 ± 0.03
2.798GlnGly: 2.798 ± 0.043
0.645GlnHis: 0.645 ± 0.019
2.792GlnIle: 2.792 ± 0.045
2.534GlnLys: 2.534 ± 0.044
3.719GlnLeu: 3.719 ± 0.052
1.03GlnMet: 1.03 ± 0.029
1.604GlnAsn: 1.604 ± 0.03
1.246GlnPro: 1.246 ± 0.033
1.551GlnGln: 1.551 ± 0.042
1.938GlnArg: 1.938 ± 0.036
2.116GlnSer: 2.116 ± 0.043
2.126GlnThr: 2.126 ± 0.041
2.778GlnVal: 2.778 ± 0.042
0.413GlnTrp: 0.413 ± 0.016
1.113GlnTyr: 1.113 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
3.024ArgAla: 3.024 ± 0.05
0.529ArgCys: 0.529 ± 0.02
2.241ArgAsp: 2.241 ± 0.033
3.515ArgGlu: 3.515 ± 0.047
1.97ArgPhe: 1.97 ± 0.039
2.785ArgGly: 2.785 ± 0.042
0.914ArgHis: 0.914 ± 0.026
3.552ArgIle: 3.552 ± 0.052
3.089ArgLys: 3.089 ± 0.047
4.995ArgLeu: 4.995 ± 0.061
1.365ArgMet: 1.365 ± 0.027
1.972ArgAsn: 1.972 ± 0.034
1.55ArgPro: 1.55 ± 0.033
1.919ArgGln: 1.919 ± 0.033
2.434ArgArg: 2.434 ± 0.045
2.505ArgSer: 2.505 ± 0.039
2.323ArgThr: 2.323 ± 0.038
3.331ArgVal: 3.331 ± 0.044
0.528ArgTrp: 0.528 ± 0.018
1.622ArgTyr: 1.622 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
4.236SerAla: 4.236 ± 0.065
0.716SerCys: 0.716 ± 0.022
2.966SerAsp: 2.966 ± 0.046
3.925SerGlu: 3.925 ± 0.053
2.862SerPhe: 2.862 ± 0.038
4.921SerGly: 4.921 ± 0.064
1.146SerHis: 1.146 ± 0.031
4.69SerIle: 4.69 ± 0.064
3.574SerLys: 3.574 ± 0.052
6.71SerLeu: 6.71 ± 0.082
1.694SerMet: 1.694 ± 0.038
2.525SerAsn: 2.525 ± 0.053
2.487SerPro: 2.487 ± 0.046
2.299SerGln: 2.299 ± 0.037
2.751SerArg: 2.751 ± 0.045
4.261SerSer: 4.261 ± 0.067
3.325SerThr: 3.325 ± 0.05
4.317SerVal: 4.317 ± 0.053
0.706SerTrp: 0.706 ± 0.024
2.034SerTyr: 2.034 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.133ThrAla: 4.133 ± 0.064
0.587ThrCys: 0.587 ± 0.02
2.716ThrAsp: 2.716 ± 0.047
3.251ThrGlu: 3.251 ± 0.05
2.253ThrPhe: 2.253 ± 0.039
4.44ThrGly: 4.44 ± 0.06
1.029ThrHis: 1.029 ± 0.027
4.114ThrIle: 4.114 ± 0.059
3.009ThrLys: 3.009 ± 0.055
5.74ThrLeu: 5.74 ± 0.061
1.352ThrMet: 1.352 ± 0.03
2.229ThrAsn: 2.229 ± 0.039
2.56ThrPro: 2.56 ± 0.044
1.906ThrGln: 1.906 ± 0.039
2.118ThrArg: 2.118 ± 0.041
3.398ThrSer: 3.398 ± 0.048
3.204ThrThr: 3.204 ± 0.06
4.177ThrVal: 4.177 ± 0.057
0.569ThrTrp: 0.569 ± 0.02
1.675ThrTyr: 1.675 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
5.459ValAla: 5.459 ± 0.063
0.871ValCys: 0.871 ± 0.025
3.673ValAsp: 3.673 ± 0.048
4.565ValGlu: 4.565 ± 0.058
3.041ValPhe: 3.041 ± 0.05
4.992ValGly: 4.992 ± 0.056
1.198ValHis: 1.198 ± 0.03
5.469ValIle: 5.469 ± 0.068
4.143ValLys: 4.143 ± 0.054
7.551ValLeu: 7.551 ± 0.076
1.969ValMet: 1.969 ± 0.038
3.037ValAsn: 3.037 ± 0.044
2.714ValPro: 2.714 ± 0.044
2.536ValGln: 2.536 ± 0.038
3.153ValArg: 3.153 ± 0.047
4.813ValSer: 4.813 ± 0.055
4.05ValThr: 4.05 ± 0.053
5.38ValVal: 5.38 ± 0.064
0.741ValTrp: 0.741 ± 0.025
2.155ValTyr: 2.155 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.732TrpAla: 0.732 ± 0.025
0.116TrpCys: 0.116 ± 0.009
0.542TrpAsp: 0.542 ± 0.021
0.671TrpGlu: 0.671 ± 0.022
0.422TrpPhe: 0.422 ± 0.015
0.846TrpGly: 0.846 ± 0.027
0.235TrpHis: 0.235 ± 0.014
0.746TrpIle: 0.746 ± 0.023
0.64TrpLys: 0.64 ± 0.021
1.193TrpLeu: 1.193 ± 0.029
0.266TrpMet: 0.266 ± 0.013
0.504TrpAsn: 0.504 ± 0.018
0.342TrpPro: 0.342 ± 0.014
0.535TrpGln: 0.535 ± 0.017
0.543TrpArg: 0.543 ± 0.019
0.666TrpSer: 0.666 ± 0.022
0.508TrpThr: 0.508 ± 0.017
0.755TrpVal: 0.755 ± 0.023
0.152TrpTrp: 0.152 ± 0.01
0.308TrpTyr: 0.308 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.246TyrAla: 2.246 ± 0.035
0.484TyrCys: 0.484 ± 0.018
1.672TyrAsp: 1.672 ± 0.035
1.906TyrGlu: 1.906 ± 0.041
1.62TyrPhe: 1.62 ± 0.033
2.412TyrGly: 2.412 ± 0.042
0.688TyrHis: 0.688 ± 0.023
2.118TyrIle: 2.118 ± 0.042
1.768TyrLys: 1.768 ± 0.038
3.572TyrLeu: 3.572 ± 0.05
0.708TyrMet: 0.708 ± 0.019
1.383TyrAsn: 1.383 ± 0.028
1.438TyrPro: 1.438 ± 0.029
1.327TyrGln: 1.327 ± 0.032
1.703TyrArg: 1.703 ± 0.034
2.08TyrSer: 2.08 ± 0.038
1.669TyrThr: 1.669 ± 0.036
2.066TyrVal: 2.066 ± 0.038
0.386TyrTrp: 0.386 ± 0.016
1.299TyrTyr: 1.299 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6205 proteins (1563146 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski