Amino acid dipepetide frequency for Candidatus Promineofilum breve

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.778AlaAla: 19.778 ± 0.223
0.849AlaCys: 0.849 ± 0.026
7.273AlaAsp: 7.273 ± 0.082
7.476AlaGlu: 7.476 ± 0.094
3.873AlaPhe: 3.873 ± 0.057
11.409AlaGly: 11.409 ± 0.117
2.328AlaHis: 2.328 ± 0.04
5.863AlaIle: 5.863 ± 0.068
1.942AlaLys: 1.942 ± 0.04
13.581AlaLeu: 13.581 ± 0.132
2.366AlaMet: 2.366 ± 0.042
3.109AlaAsn: 3.109 ± 0.049
5.888AlaPro: 5.888 ± 0.095
3.963AlaGln: 3.963 ± 0.061
7.955AlaArg: 7.955 ± 0.089
3.874AlaSer: 3.874 ± 0.051
7.324AlaThr: 7.324 ± 0.102
9.311AlaVal: 9.311 ± 0.101
1.769AlaTrp: 1.769 ± 0.044
3.171AlaTyr: 3.171 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.765CysAla: 0.765 ± 0.026
0.083CysCys: 0.083 ± 0.008
0.434CysAsp: 0.434 ± 0.02
0.289CysGlu: 0.289 ± 0.013
0.224CysPhe: 0.224 ± 0.013
0.739CysGly: 0.739 ± 0.024
0.247CysHis: 0.247 ± 0.017
0.263CysIle: 0.263 ± 0.014
0.113CysLys: 0.113 ± 0.01
0.675CysLeu: 0.675 ± 0.02
0.107CysMet: 0.107 ± 0.008
0.195CysAsn: 0.195 ± 0.012
0.485CysPro: 0.485 ± 0.021
0.189CysGln: 0.189 ± 0.011
0.568CysArg: 0.568 ± 0.02
0.362CysSer: 0.362 ± 0.018
0.345CysThr: 0.345 ± 0.019
0.458CysVal: 0.458 ± 0.019
0.116CysTrp: 0.116 ± 0.011
0.199CysTyr: 0.199 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.192AspAla: 6.192 ± 0.074
0.438AspCys: 0.438 ± 0.022
3.746AspAsp: 3.746 ± 0.07
4.161AspGlu: 4.161 ± 0.06
2.227AspPhe: 2.227 ± 0.044
5.787AspGly: 5.787 ± 0.093
1.18AspHis: 1.18 ± 0.032
2.632AspIle: 2.632 ± 0.052
1.328AspLys: 1.328 ± 0.032
5.835AspLeu: 5.835 ± 0.078
1.18AspMet: 1.18 ± 0.03
1.758AspAsn: 1.758 ± 0.055
3.679AspPro: 3.679 ± 0.061
1.643AspGln: 1.643 ± 0.034
3.646AspArg: 3.646 ± 0.049
2.528AspSer: 2.528 ± 0.046
2.873AspThr: 2.873 ± 0.048
4.483AspVal: 4.483 ± 0.056
1.075AspTrp: 1.075 ± 0.027
1.887AspTyr: 1.887 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
7.015GluAla: 7.015 ± 0.093
0.311GluCys: 0.311 ± 0.015
2.366GluAsp: 2.366 ± 0.042
3.594GluGlu: 3.594 ± 0.066
1.871GluPhe: 1.871 ± 0.041
3.942GluGly: 3.942 ± 0.059
1.052GluHis: 1.052 ± 0.032
3.053GluIle: 3.053 ± 0.047
1.489GluLys: 1.489 ± 0.038
6.048GluLeu: 6.048 ± 0.079
1.776GluMet: 1.776 ± 0.031
1.522GluAsn: 1.522 ± 0.036
3.042GluPro: 3.042 ± 0.044
2.535GluGln: 2.535 ± 0.042
4.635GluArg: 4.635 ± 0.069
2.637GluSer: 2.637 ± 0.046
3.36GluThr: 3.36 ± 0.056
3.918GluVal: 3.918 ± 0.06
1.015GluTrp: 1.015 ± 0.026
1.338GluTyr: 1.338 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
3.831PheAla: 3.831 ± 0.05
0.276PheCys: 0.276 ± 0.015
2.534PheAsp: 2.534 ± 0.043
1.736PheGlu: 1.736 ± 0.033
1.48PhePhe: 1.48 ± 0.034
3.185PheGly: 3.185 ± 0.049
0.76PheHis: 0.76 ± 0.023
1.874PheIle: 1.874 ± 0.041
0.59PheLys: 0.59 ± 0.023
3.62PheLeu: 3.62 ± 0.057
0.765PheMet: 0.765 ± 0.024
1.312PheAsn: 1.312 ± 0.035
1.601PhePro: 1.601 ± 0.036
1.106PheGln: 1.106 ± 0.03
2.089PheArg: 2.089 ± 0.04
2.012PheSer: 2.012 ± 0.038
2.136PheThr: 2.136 ± 0.046
2.731PheVal: 2.731 ± 0.043
0.577PheTrp: 0.577 ± 0.018
1.106PheTyr: 1.106 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
8.708GlyAla: 8.708 ± 0.102
0.701GlyCys: 0.701 ± 0.024
5.288GlyAsp: 5.288 ± 0.09
5.188GlyGlu: 5.188 ± 0.074
2.955GlyPhe: 2.955 ± 0.043
7.935GlyGly: 7.935 ± 0.115
1.836GlyHis: 1.836 ± 0.037
4.082GlyIle: 4.082 ± 0.054
2.022GlyLys: 2.022 ± 0.04
9.379GlyLeu: 9.379 ± 0.102
1.906GlyMet: 1.906 ± 0.035
2.505GlyAsn: 2.505 ± 0.054
3.726GlyPro: 3.726 ± 0.055
3.649GlyGln: 3.649 ± 0.057
7.189GlyArg: 7.189 ± 0.093
4.147GlySer: 4.147 ± 0.064
4.082GlyThr: 4.082 ± 0.079
6.201GlyVal: 6.201 ± 0.076
1.656GlyTrp: 1.656 ± 0.04
2.607GlyTyr: 2.607 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
1.969HisAla: 1.969 ± 0.037
0.213HisCys: 0.213 ± 0.012
1.195HisAsp: 1.195 ± 0.029
1.104HisGlu: 1.104 ± 0.03
0.825HisPhe: 0.825 ± 0.023
1.657HisGly: 1.657 ± 0.034
0.619HisHis: 0.619 ± 0.021
0.99HisIle: 0.99 ± 0.026
0.42HisLys: 0.42 ± 0.018
2.194HisLeu: 2.194 ± 0.041
0.413HisMet: 0.413 ± 0.015
0.643HisAsn: 0.643 ± 0.017
1.306HisPro: 1.306 ± 0.032
0.593HisGln: 0.593 ± 0.021
1.332HisArg: 1.332 ± 0.031
0.952HisSer: 0.952 ± 0.026
1.088HisThr: 1.088 ± 0.029
1.464HisVal: 1.464 ± 0.033
0.347HisTrp: 0.347 ± 0.017
0.767HisTyr: 0.767 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.891IleAla: 5.891 ± 0.07
0.372IleCys: 0.372 ± 0.019
3.705IleAsp: 3.705 ± 0.052
2.997IleGlu: 2.997 ± 0.049
1.692IlePhe: 1.692 ± 0.033
4.254IleGly: 4.254 ± 0.051
1.097IleHis: 1.097 ± 0.028
2.75IleIle: 2.75 ± 0.041
1.049IleLys: 1.049 ± 0.03
4.807IleLeu: 4.807 ± 0.071
0.868IleMet: 0.868 ± 0.021
1.539IleAsn: 1.539 ± 0.033
2.598IlePro: 2.598 ± 0.045
1.406IleGln: 1.406 ± 0.033
2.994IleArg: 2.994 ± 0.054
2.48IleSer: 2.48 ± 0.042
3.02IleThr: 3.02 ± 0.055
4.075IleVal: 4.075 ± 0.061
0.64IleTrp: 0.64 ± 0.022
1.446IleTyr: 1.446 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
2.162LysAla: 2.162 ± 0.049
0.106LysCys: 0.106 ± 0.008
0.992LysAsp: 0.992 ± 0.027
1.185LysGlu: 1.185 ± 0.032
0.647LysPhe: 0.647 ± 0.021
1.57LysGly: 1.57 ± 0.043
0.402LysHis: 0.402 ± 0.017
1.028LysIle: 1.028 ± 0.03
0.787LysLys: 0.787 ± 0.028
2.104LysLeu: 2.104 ± 0.042
0.549LysMet: 0.549 ± 0.02
0.66LysAsn: 0.66 ± 0.02
1.229LysPro: 1.229 ± 0.033
0.785LysGln: 0.785 ± 0.023
1.59LysArg: 1.59 ± 0.036
1.148LysSer: 1.148 ± 0.026
1.253LysThr: 1.253 ± 0.033
1.507LysVal: 1.507 ± 0.033
0.329LysTrp: 0.329 ± 0.014
0.587LysTyr: 0.587 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
14.423LeuAla: 14.423 ± 0.142
0.756LeuCys: 0.756 ± 0.029
6.023LeuAsp: 6.023 ± 0.075
5.176LeuGlu: 5.176 ± 0.072
3.871LeuPhe: 3.871 ± 0.055
8.667LeuGly: 8.667 ± 0.095
2.005LeuHis: 2.005 ± 0.037
5.586LeuIle: 5.586 ± 0.075
2.163LeuLys: 2.163 ± 0.051
12.631LeuLeu: 12.631 ± 0.149
2.18LeuMet: 2.18 ± 0.044
2.959LeuAsn: 2.959 ± 0.048
6.772LeuPro: 6.772 ± 0.076
2.957LeuGln: 2.957 ± 0.045
7.438LeuArg: 7.438 ± 0.076
6.007LeuSer: 6.007 ± 0.081
6.507LeuThr: 6.507 ± 0.082
7.626LeuVal: 7.626 ± 0.078
1.577LeuTrp: 1.577 ± 0.039
2.828LeuTyr: 2.828 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.737MetAla: 2.737 ± 0.04
0.099MetCys: 0.099 ± 0.008
1.027MetAsp: 1.027 ± 0.027
1.027MetGlu: 1.027 ± 0.026
0.535MetPhe: 0.535 ± 0.02
1.706MetGly: 1.706 ± 0.04
0.322MetHis: 0.322 ± 0.014
1.082MetIle: 1.082 ± 0.03
0.681MetLys: 0.681 ± 0.023
2.025MetLeu: 2.025 ± 0.037
0.538MetMet: 0.538 ± 0.019
0.838MetAsn: 0.838 ± 0.024
1.223MetPro: 1.223 ± 0.03
0.664MetGln: 0.664 ± 0.021
1.485MetArg: 1.485 ± 0.03
1.347MetSer: 1.347 ± 0.025
1.635MetThr: 1.635 ± 0.026
1.452MetVal: 1.452 ± 0.034
0.244MetTrp: 0.244 ± 0.013
0.349MetTyr: 0.349 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.789AsnAla: 2.789 ± 0.053
0.237AsnCys: 0.237 ± 0.013
1.683AsnAsp: 1.683 ± 0.048
1.58AsnGlu: 1.58 ± 0.03
1.014AsnPhe: 1.014 ± 0.028
2.84AsnGly: 2.84 ± 0.059
0.577AsnHis: 0.577 ± 0.022
1.365AsnIle: 1.365 ± 0.037
0.665AsnLys: 0.665 ± 0.022
2.983AsnLeu: 2.983 ± 0.053
0.647AsnMet: 0.647 ± 0.019
1.127AsnAsn: 1.127 ± 0.041
2.138AsnPro: 2.138 ± 0.041
0.983AsnGln: 0.983 ± 0.023
1.946AsnArg: 1.946 ± 0.036
1.409AsnSer: 1.409 ± 0.039
1.506AsnThr: 1.506 ± 0.045
2.263AsnVal: 2.263 ± 0.042
0.574AsnTrp: 0.574 ± 0.023
0.91AsnTyr: 0.91 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.993ProAla: 7.993 ± 0.098
0.268ProCys: 0.268 ± 0.014
4.102ProAsp: 4.102 ± 0.058
3.437ProGlu: 3.437 ± 0.054
2.051ProPhe: 2.051 ± 0.035
5.19ProGly: 5.19 ± 0.066
1.061ProHis: 1.061 ± 0.029
2.571ProIle: 2.571 ± 0.047
0.891ProLys: 0.891 ± 0.027
5.759ProLeu: 5.759 ± 0.07
0.977ProMet: 0.977 ± 0.025
1.647ProAsn: 1.647 ± 0.041
4.283ProPro: 4.283 ± 0.089
1.829ProGln: 1.829 ± 0.037
3.151ProArg: 3.151 ± 0.056
2.669ProSer: 2.669 ± 0.042
4.041ProThr: 4.041 ± 0.081
4.216ProVal: 4.216 ± 0.06
0.803ProTrp: 0.803 ± 0.024
1.498ProTyr: 1.498 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
4.261GlnAla: 4.261 ± 0.068
0.187GlnCys: 0.187 ± 0.011
1.377GlnAsp: 1.377 ± 0.027
1.677GlnGlu: 1.677 ± 0.037
1.194GlnPhe: 1.194 ± 0.029
2.287GlnGly: 2.287 ± 0.037
0.586GlnHis: 0.586 ± 0.019
1.778GlnIle: 1.778 ± 0.036
0.76GlnLys: 0.76 ± 0.025
3.733GlnLeu: 3.733 ± 0.059
0.847GlnMet: 0.847 ± 0.024
0.853GlnAsn: 0.853 ± 0.022
2.489GlnPro: 2.489 ± 0.046
1.443GlnGln: 1.443 ± 0.039
2.603GlnArg: 2.603 ± 0.042
1.692GlnSer: 1.692 ± 0.032
2.172GlnThr: 2.172 ± 0.041
2.442GlnVal: 2.442 ± 0.039
0.624GlnTrp: 0.624 ± 0.021
0.889GlnTyr: 0.889 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
7.831ArgAla: 7.831 ± 0.099
0.414ArgCys: 0.414 ± 0.018
3.814ArgAsp: 3.814 ± 0.052
3.979ArgGlu: 3.979 ± 0.059
2.609ArgPhe: 2.609 ± 0.04
4.962ArgGly: 4.962 ± 0.073
1.683ArgHis: 1.683 ± 0.037
3.108ArgIle: 3.108 ± 0.044
1.365ArgLys: 1.365 ± 0.03
8.71ArgLeu: 8.71 ± 0.11
1.398ArgMet: 1.398 ± 0.031
1.724ArgAsn: 1.724 ± 0.036
4.578ArgPro: 4.578 ± 0.064
3.112ArgGln: 3.112 ± 0.049
6.513ArgArg: 6.513 ± 0.09
2.644ArgSer: 2.644 ± 0.047
2.82ArgThr: 2.82 ± 0.045
5.146ArgVal: 5.146 ± 0.066
1.226ArgTrp: 1.226 ± 0.032
2.094ArgTyr: 2.094 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
5.193SerAla: 5.193 ± 0.065
0.313SerCys: 0.313 ± 0.018
2.724SerAsp: 2.724 ± 0.051
2.296SerGlu: 2.296 ± 0.042
1.868SerPhe: 1.868 ± 0.035
4.94SerGly: 4.94 ± 0.077
0.979SerHis: 0.979 ± 0.026
2.268SerIle: 2.268 ± 0.045
0.957SerLys: 0.957 ± 0.026
5.098SerLeu: 5.098 ± 0.064
0.95SerMet: 0.95 ± 0.024
1.357SerAsn: 1.357 ± 0.038
3.234SerPro: 3.234 ± 0.053
1.495SerGln: 1.495 ± 0.032
3.106SerArg: 3.106 ± 0.048
2.379SerSer: 2.379 ± 0.05
2.444SerThr: 2.444 ± 0.049
3.162SerVal: 3.162 ± 0.05
0.748SerTrp: 0.748 ± 0.022
1.329SerTyr: 1.329 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
7.356ThrAla: 7.356 ± 0.095
0.366ThrCys: 0.366 ± 0.016
2.967ThrAsp: 2.967 ± 0.062
2.65ThrGlu: 2.65 ± 0.044
2.224ThrPhe: 2.224 ± 0.054
5.008ThrGly: 5.008 ± 0.072
1.027ThrHis: 1.027 ± 0.025
3.416ThrIle: 3.416 ± 0.055
1.03ThrLys: 1.03 ± 0.031
6.392ThrLeu: 6.392 ± 0.073
1.115ThrMet: 1.115 ± 0.032
1.698ThrAsn: 1.698 ± 0.037
4.144ThrPro: 4.144 ± 0.081
1.62ThrGln: 1.62 ± 0.033
3.116ThrArg: 3.116 ± 0.046
2.607ThrSer: 2.607 ± 0.053
3.718ThrThr: 3.718 ± 0.073
4.571ThrVal: 4.571 ± 0.073
0.835ThrTrp: 0.835 ± 0.026
1.535ThrTyr: 1.535 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
9.749ValAla: 9.749 ± 0.103
0.528ValCys: 0.528 ± 0.02
4.219ValAsp: 4.219 ± 0.057
4.475ValGlu: 4.475 ± 0.065
2.47ValPhe: 2.47 ± 0.04
6.195ValGly: 6.195 ± 0.073
1.323ValHis: 1.323 ± 0.029
4.264ValIle: 4.264 ± 0.052
1.541ValLys: 1.541 ± 0.04
7.198ValLeu: 7.198 ± 0.092
1.675ValMet: 1.675 ± 0.03
2.365ValAsn: 2.365 ± 0.05
3.56ValPro: 3.56 ± 0.051
2.048ValGln: 2.048 ± 0.036
4.609ValArg: 4.609 ± 0.063
3.701ValSer: 3.701 ± 0.049
4.753ValThr: 4.753 ± 0.077
6.554ValVal: 6.554 ± 0.085
1.226ValTrp: 1.226 ± 0.028
2.185ValTyr: 2.185 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.677TrpAla: 1.677 ± 0.035
0.097TrpCys: 0.097 ± 0.008
0.856TrpAsp: 0.856 ± 0.027
0.791TrpGlu: 0.791 ± 0.024
0.556TrpPhe: 0.556 ± 0.022
1.151TrpGly: 1.151 ± 0.031
0.319TrpHis: 0.319 ± 0.014
0.602TrpIle: 0.602 ± 0.021
0.265TrpLys: 0.265 ± 0.013
2.31TrpLeu: 2.31 ± 0.047
0.28TrpMet: 0.28 ± 0.014
0.513TrpAsn: 0.513 ± 0.023
0.961TrpPro: 0.961 ± 0.029
0.877TrpGln: 0.877 ± 0.029
1.511TrpArg: 1.511 ± 0.037
0.888TrpSer: 0.888 ± 0.03
0.775TrpThr: 0.775 ± 0.024
1.063TrpVal: 1.063 ± 0.027
0.34TrpTrp: 0.34 ± 0.019
0.434TrpTyr: 0.434 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.782TyrAla: 2.782 ± 0.047
0.244TyrCys: 0.244 ± 0.014
1.952TyrAsp: 1.952 ± 0.036
1.698TyrGlu: 1.698 ± 0.036
1.138TyrPhe: 1.138 ± 0.032
2.338TyrGly: 2.338 ± 0.042
0.688TyrHis: 0.688 ± 0.024
1.104TyrIle: 1.104 ± 0.028
0.562TyrLys: 0.562 ± 0.021
3.07TyrLeu: 3.07 ± 0.051
0.477TyrMet: 0.477 ± 0.017
0.912TyrAsn: 0.912 ± 0.027
1.506TyrPro: 1.506 ± 0.032
0.997TyrGln: 0.997 ± 0.029
2.254TyrArg: 2.254 ± 0.04
1.355TyrSer: 1.355 ± 0.032
1.546TyrThr: 1.546 ± 0.037
1.995TyrVal: 1.995 ± 0.042
0.543TyrTrp: 0.543 ± 0.019
0.997TyrTyr: 0.997 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4566 proteins (1531040 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski