Amino acid dipepetide frequency for Tamlana nanhaiensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.136AlaAla: 4.136 ± 0.087
0.609AlaCys: 0.609 ± 0.023
3.227AlaAsp: 3.227 ± 0.064
4.253AlaGlu: 4.253 ± 0.083
3.663AlaPhe: 3.663 ± 0.06
3.983AlaGly: 3.983 ± 0.073
1.105AlaHis: 1.105 ± 0.034
5.408AlaIle: 5.408 ± 0.076
4.982AlaLys: 4.982 ± 0.08
6.7AlaLeu: 6.7 ± 0.095
1.47AlaMet: 1.47 ± 0.037
4.084AlaAsn: 4.084 ± 0.07
1.907AlaPro: 1.907 ± 0.051
2.504AlaGln: 2.504 ± 0.047
1.865AlaArg: 1.865 ± 0.044
4.604AlaSer: 4.604 ± 0.079
3.868AlaThr: 3.868 ± 0.087
4.307AlaVal: 4.307 ± 0.07
0.654AlaTrp: 0.654 ± 0.027
2.712AlaTyr: 2.712 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.467CysAla: 0.467 ± 0.022
0.102CysCys: 0.102 ± 0.01
0.47CysAsp: 0.47 ± 0.026
0.472CysGlu: 0.472 ± 0.022
0.407CysPhe: 0.407 ± 0.019
0.574CysGly: 0.574 ± 0.028
0.178CysHis: 0.178 ± 0.014
0.588CysIle: 0.588 ± 0.026
0.549CysLys: 0.549 ± 0.024
0.681CysLeu: 0.681 ± 0.023
0.141CysMet: 0.141 ± 0.013
0.467CysAsn: 0.467 ± 0.021
0.286CysPro: 0.286 ± 0.02
0.183CysGln: 0.183 ± 0.015
0.165CysArg: 0.165 ± 0.013
0.542CysSer: 0.542 ± 0.025
0.406CysThr: 0.406 ± 0.027
0.45CysVal: 0.45 ± 0.019
0.076CysTrp: 0.076 ± 0.008
0.336CysTyr: 0.336 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.256AspAla: 4.256 ± 0.072
0.405AspCys: 0.405 ± 0.02
3.486AspAsp: 3.486 ± 0.07
3.679AspGlu: 3.679 ± 0.068
3.678AspPhe: 3.678 ± 0.057
3.754AspGly: 3.754 ± 0.082
0.794AspHis: 0.794 ± 0.027
4.457AspIle: 4.457 ± 0.061
3.839AspLys: 3.839 ± 0.071
5.13AspLeu: 5.13 ± 0.076
1.075AspMet: 1.075 ± 0.031
3.35AspAsn: 3.35 ± 0.063
1.447AspPro: 1.447 ± 0.042
1.236AspGln: 1.236 ± 0.034
1.6AspArg: 1.6 ± 0.035
3.103AspSer: 3.103 ± 0.064
2.844AspThr: 2.844 ± 0.058
4.068AspVal: 4.068 ± 0.063
0.764AspTrp: 0.764 ± 0.026
2.924AspTyr: 2.924 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
5.3GluAla: 5.3 ± 0.088
0.283GluCys: 0.283 ± 0.019
3.716GluAsp: 3.716 ± 0.061
4.011GluGlu: 4.011 ± 0.08
2.873GluPhe: 2.873 ± 0.054
3.491GluGly: 3.491 ± 0.067
1.156GluHis: 1.156 ± 0.029
4.973GluIle: 4.973 ± 0.076
5.126GluLys: 5.126 ± 0.088
5.579GluLeu: 5.579 ± 0.082
1.269GluMet: 1.269 ± 0.036
4.821GluAsn: 4.821 ± 0.078
1.577GluPro: 1.577 ± 0.042
2.199GluGln: 2.199 ± 0.048
2.227GluArg: 2.227 ± 0.053
3.324GluSer: 3.324 ± 0.055
4.449GluThr: 4.449 ± 0.068
4.265GluVal: 4.265 ± 0.068
0.562GluTrp: 0.562 ± 0.024
2.232GluTyr: 2.232 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.022PheAla: 3.022 ± 0.065
0.469PheCys: 0.469 ± 0.023
3.175PheAsp: 3.175 ± 0.055
3.355PheGlu: 3.355 ± 0.064
2.584PhePhe: 2.584 ± 0.067
3.6PheGly: 3.6 ± 0.067
0.75PheHis: 0.75 ± 0.029
3.866PheIle: 3.866 ± 0.069
4.368PheLys: 4.368 ± 0.075
4.682PheLeu: 4.682 ± 0.081
1.091PheMet: 1.091 ± 0.035
4.001PheAsn: 4.001 ± 0.066
1.638PhePro: 1.638 ± 0.039
1.397PheGln: 1.397 ± 0.04
1.461PheArg: 1.461 ± 0.035
4.056PheSer: 4.056 ± 0.065
3.3PheThr: 3.3 ± 0.055
3.069PheVal: 3.069 ± 0.057
0.609PheTrp: 0.609 ± 0.027
2.261PheTyr: 2.261 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.172GlyAla: 4.172 ± 0.077
0.603GlyCys: 0.603 ± 0.04
3.529GlyAsp: 3.529 ± 0.066
3.679GlyGlu: 3.679 ± 0.061
3.631GlyPhe: 3.631 ± 0.059
4.455GlyGly: 4.455 ± 0.098
1.147GlyHis: 1.147 ± 0.034
4.985GlyIle: 4.985 ± 0.075
4.651GlyLys: 4.651 ± 0.07
5.708GlyLeu: 5.708 ± 0.082
1.495GlyMet: 1.495 ± 0.034
3.74GlyAsn: 3.74 ± 0.068
1.342GlyPro: 1.342 ± 0.038
1.716GlyGln: 1.716 ± 0.043
1.919GlyArg: 1.919 ± 0.049
3.894GlySer: 3.894 ± 0.073
3.872GlyThr: 3.872 ± 0.085
4.491GlyVal: 4.491 ± 0.068
0.802GlyTrp: 0.802 ± 0.029
2.811GlyTyr: 2.811 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.028HisAla: 1.028 ± 0.03
0.19HisCys: 0.19 ± 0.013
0.921HisAsp: 0.921 ± 0.033
0.95HisGlu: 0.95 ± 0.03
1.175HisPhe: 1.175 ± 0.034
1.089HisGly: 1.089 ± 0.037
0.451HisHis: 0.451 ± 0.023
1.406HisIle: 1.406 ± 0.039
1.248HisLys: 1.248 ± 0.036
1.747HisLeu: 1.747 ± 0.049
0.327HisMet: 0.327 ± 0.017
1.076HisAsn: 1.076 ± 0.027
0.845HisPro: 0.845 ± 0.03
0.651HisGln: 0.651 ± 0.026
0.589HisArg: 0.589 ± 0.023
0.939HisSer: 0.939 ± 0.031
0.929HisThr: 0.929 ± 0.029
1.018HisVal: 1.018 ± 0.03
0.245HisTrp: 0.245 ± 0.015
0.844HisTyr: 0.844 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.386IleAla: 5.386 ± 0.084
0.602IleCys: 0.602 ± 0.025
4.866IleAsp: 4.866 ± 0.075
5.527IleGlu: 5.527 ± 0.091
3.508IlePhe: 3.508 ± 0.078
4.848IleGly: 4.848 ± 0.084
1.247IleHis: 1.247 ± 0.033
5.847IleIle: 5.847 ± 0.098
5.915IleLys: 5.915 ± 0.087
6.584IleLeu: 6.584 ± 0.104
1.359IleMet: 1.359 ± 0.04
5.242IleAsn: 5.242 ± 0.083
3.021IlePro: 3.021 ± 0.056
2.267IleGln: 2.267 ± 0.054
2.158IleArg: 2.158 ± 0.043
5.51IleSer: 5.51 ± 0.071
5.054IleThr: 5.054 ± 0.08
4.662IleVal: 4.662 ± 0.069
0.751IleTrp: 0.751 ± 0.028
2.856IleTyr: 2.856 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
5.402LysAla: 5.402 ± 0.083
0.335LysCys: 0.335 ± 0.017
4.158LysAsp: 4.158 ± 0.069
4.889LysGlu: 4.889 ± 0.081
3.014LysPhe: 3.014 ± 0.061
4.246LysGly: 4.246 ± 0.076
1.669LysHis: 1.669 ± 0.047
5.876LysIle: 5.876 ± 0.087
6.282LysLys: 6.282 ± 0.1
6.849LysLeu: 6.849 ± 0.103
1.756LysMet: 1.756 ± 0.044
5.687LysAsn: 5.687 ± 0.081
2.816LysPro: 2.816 ± 0.052
3.24LysGln: 3.24 ± 0.062
2.846LysArg: 2.846 ± 0.056
4.709LysSer: 4.709 ± 0.069
5.463LysThr: 5.463 ± 0.082
4.58LysVal: 4.58 ± 0.076
0.762LysTrp: 0.762 ± 0.029
3.003LysTyr: 3.003 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
5.748LeuAla: 5.748 ± 0.077
0.699LeuCys: 0.699 ± 0.028
4.765LeuAsp: 4.765 ± 0.073
5.749LeuGlu: 5.749 ± 0.083
4.722LeuPhe: 4.722 ± 0.082
5.677LeuGly: 5.677 ± 0.097
1.542LeuHis: 1.542 ± 0.045
6.792LeuIle: 6.792 ± 0.104
8.186LeuLys: 8.186 ± 0.113
8.046LeuLeu: 8.046 ± 0.112
1.928LeuMet: 1.928 ± 0.043
6.752LeuAsn: 6.752 ± 0.097
3.455LeuPro: 3.455 ± 0.055
3.37LeuGln: 3.37 ± 0.061
2.935LeuArg: 2.935 ± 0.056
6.478LeuSer: 6.478 ± 0.079
5.258LeuThr: 5.258 ± 0.076
5.516LeuVal: 5.516 ± 0.084
0.859LeuTrp: 0.859 ± 0.03
3.132LeuTyr: 3.132 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.038
0.132MetCys: 0.132 ± 0.011
0.932MetAsp: 0.932 ± 0.03
1.118MetGlu: 1.118 ± 0.032
0.92MetPhe: 0.92 ± 0.034
1.243MetGly: 1.243 ± 0.037
0.397MetHis: 0.397 ± 0.019
1.309MetIle: 1.309 ± 0.037
1.836MetLys: 1.836 ± 0.038
1.97MetLeu: 1.97 ± 0.041
0.499MetMet: 0.499 ± 0.024
1.117MetAsn: 1.117 ± 0.032
0.902MetPro: 0.902 ± 0.03
0.887MetGln: 0.887 ± 0.029
0.809MetArg: 0.809 ± 0.025
1.323MetSer: 1.323 ± 0.033
0.998MetThr: 0.998 ± 0.038
1.328MetVal: 1.328 ± 0.034
0.171MetTrp: 0.171 ± 0.014
0.74MetTyr: 0.74 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
4.408AsnAla: 4.408 ± 0.078
0.504AsnCys: 0.504 ± 0.023
3.839AsnAsp: 3.839 ± 0.061
4.076AsnGlu: 4.076 ± 0.067
3.368AsnPhe: 3.368 ± 0.059
4.297AsnGly: 4.297 ± 0.092
1.203AsnHis: 1.203 ± 0.037
5.397AsnIle: 5.397 ± 0.077
4.821AsnLys: 4.821 ± 0.075
6.084AsnLeu: 6.084 ± 0.088
1.323AsnMet: 1.323 ± 0.031
4.89AsnAsn: 4.89 ± 0.088
2.839AsnPro: 2.839 ± 0.053
2.47AsnGln: 2.47 ± 0.052
2.248AsnArg: 2.248 ± 0.047
4.022AsnSer: 4.022 ± 0.07
4.423AsnThr: 4.423 ± 0.079
3.928AsnVal: 3.928 ± 0.067
0.822AsnTrp: 0.822 ± 0.031
3.321AsnTyr: 3.321 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
1.775ProAla: 1.775 ± 0.046
0.194ProCys: 0.194 ± 0.012
2.073ProAsp: 2.073 ± 0.041
3.03ProGlu: 3.03 ± 0.058
1.903ProPhe: 1.903 ± 0.045
1.813ProGly: 1.813 ± 0.05
0.624ProHis: 0.624 ± 0.026
2.591ProIle: 2.591 ± 0.05
2.7ProLys: 2.7 ± 0.05
2.801ProLeu: 2.801 ± 0.06
0.652ProMet: 0.652 ± 0.028
2.375ProAsn: 2.375 ± 0.052
0.751ProPro: 0.751 ± 0.029
1.084ProGln: 1.084 ± 0.031
0.882ProArg: 0.882 ± 0.031
2.045ProSer: 2.045 ± 0.042
1.986ProThr: 1.986 ± 0.054
2.23ProVal: 2.23 ± 0.046
0.395ProTrp: 0.395 ± 0.022
1.435ProTyr: 1.435 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.107GlnAla: 2.107 ± 0.05
0.185GlnCys: 0.185 ± 0.013
1.742GlnAsp: 1.742 ± 0.041
2.125GlnGlu: 2.125 ± 0.044
1.73GlnPhe: 1.73 ± 0.043
1.822GlnGly: 1.822 ± 0.042
0.627GlnHis: 0.627 ± 0.024
2.56GlnIle: 2.56 ± 0.047
2.582GlnLys: 2.582 ± 0.055
3.49GlnLeu: 3.49 ± 0.062
0.704GlnMet: 0.704 ± 0.029
2.449GlnAsn: 2.449 ± 0.059
1.139GlnPro: 1.139 ± 0.037
1.393GlnGln: 1.393 ± 0.044
1.184GlnArg: 1.184 ± 0.034
1.839GlnSer: 1.839 ± 0.046
1.976GlnThr: 1.976 ± 0.048
1.922GlnVal: 1.922 ± 0.042
0.385GlnTrp: 0.385 ± 0.019
1.345GlnTyr: 1.345 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.013ArgAla: 2.013 ± 0.042
0.194ArgCys: 0.194 ± 0.012
1.715ArgAsp: 1.715 ± 0.043
1.873ArgGlu: 1.873 ± 0.044
1.789ArgPhe: 1.789 ± 0.036
1.909ArgGly: 1.909 ± 0.042
0.588ArgHis: 0.588 ± 0.023
2.565ArgIle: 2.565 ± 0.049
2.368ArgLys: 2.368 ± 0.057
3.023ArgLeu: 3.023 ± 0.06
0.723ArgMet: 0.723 ± 0.027
2.021ArgAsn: 2.021 ± 0.044
0.991ArgPro: 0.991 ± 0.033
0.999ArgGln: 0.999 ± 0.032
1.164ArgArg: 1.164 ± 0.035
1.725ArgSer: 1.725 ± 0.043
1.748ArgThr: 1.748 ± 0.039
2.063ArgVal: 2.063 ± 0.042
0.414ArgTrp: 0.414 ± 0.023
1.463ArgTyr: 1.463 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
3.882SerAla: 3.882 ± 0.069
0.645SerCys: 0.645 ± 0.03
3.359SerAsp: 3.359 ± 0.066
4.019SerGlu: 4.019 ± 0.071
3.716SerPhe: 3.716 ± 0.063
4.593SerGly: 4.593 ± 0.068
1.105SerHis: 1.105 ± 0.031
5.122SerIle: 5.122 ± 0.071
4.891SerLys: 4.891 ± 0.075
5.714SerLeu: 5.714 ± 0.078
1.234SerMet: 1.234 ± 0.033
4.184SerAsn: 4.184 ± 0.079
1.967SerPro: 1.967 ± 0.046
2.079SerGln: 2.079 ± 0.044
1.943SerArg: 1.943 ± 0.045
4.0SerSer: 4.0 ± 0.07
3.682SerThr: 3.682 ± 0.07
4.288SerVal: 4.288 ± 0.066
0.719SerTrp: 0.719 ± 0.03
2.751SerTyr: 2.751 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
3.871ThrAla: 3.871 ± 0.081
0.389ThrCys: 0.389 ± 0.023
3.359ThrAsp: 3.359 ± 0.064
3.716ThrGlu: 3.716 ± 0.05
3.309ThrPhe: 3.309 ± 0.065
4.019ThrGly: 4.019 ± 0.079
1.073ThrHis: 1.073 ± 0.029
5.099ThrIle: 5.099 ± 0.08
4.252ThrLys: 4.252 ± 0.073
5.828ThrLeu: 5.828 ± 0.083
0.938ThrMet: 0.938 ± 0.028
3.966ThrAsn: 3.966 ± 0.08
2.612ThrPro: 2.612 ± 0.058
1.959ThrGln: 1.959 ± 0.046
1.572ThrArg: 1.572 ± 0.04
4.017ThrSer: 4.017 ± 0.068
3.764ThrThr: 3.764 ± 0.084
4.036ThrVal: 4.036 ± 0.088
0.67ThrTrp: 0.67 ± 0.027
2.62ThrTyr: 2.62 ± 0.069
0.0ThrXaa: 0.0 ± 0.0
Val
4.248ValAla: 4.248 ± 0.065
0.563ValCys: 0.563 ± 0.022
3.664ValAsp: 3.664 ± 0.072
4.0ValGlu: 4.0 ± 0.068
3.603ValPhe: 3.603 ± 0.059
3.83ValGly: 3.83 ± 0.073
0.919ValHis: 0.919 ± 0.032
4.923ValIle: 4.923 ± 0.074
4.822ValLys: 4.822 ± 0.072
6.191ValLeu: 6.191 ± 0.081
1.266ValMet: 1.266 ± 0.034
4.077ValAsn: 4.077 ± 0.066
2.034ValPro: 2.034 ± 0.042
1.584ValGln: 1.584 ± 0.039
1.822ValArg: 1.822 ± 0.048
4.531ValSer: 4.531 ± 0.064
3.962ValThr: 3.962 ± 0.088
4.377ValVal: 4.377 ± 0.073
0.639ValTrp: 0.639 ± 0.025
2.502ValTyr: 2.502 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.027
0.114TrpCys: 0.114 ± 0.01
0.66TrpAsp: 0.66 ± 0.028
0.73TrpGlu: 0.73 ± 0.03
0.624TrpPhe: 0.624 ± 0.025
0.714TrpGly: 0.714 ± 0.028
0.244TrpHis: 0.244 ± 0.015
0.677TrpIle: 0.677 ± 0.024
0.787TrpLys: 0.787 ± 0.028
1.013TrpLeu: 1.013 ± 0.034
0.296TrpMet: 0.296 ± 0.017
0.725TrpAsn: 0.725 ± 0.03
0.267TrpPro: 0.267 ± 0.017
0.487TrpGln: 0.487 ± 0.022
0.452TrpArg: 0.452 ± 0.023
0.652TrpSer: 0.652 ± 0.023
0.589TrpThr: 0.589 ± 0.03
0.655TrpVal: 0.655 ± 0.027
0.176TrpTrp: 0.176 ± 0.015
0.467TrpTyr: 0.467 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.541TyrAla: 2.541 ± 0.051
0.322TyrCys: 0.322 ± 0.019
2.231TyrAsp: 2.231 ± 0.04
2.183TyrGlu: 2.183 ± 0.045
2.537TyrPhe: 2.537 ± 0.052
2.683TyrGly: 2.683 ± 0.054
0.851TyrHis: 0.851 ± 0.028
2.769TyrIle: 2.769 ± 0.047
3.378TyrLys: 3.378 ± 0.059
3.79TyrLeu: 3.79 ± 0.065
0.704TyrMet: 0.704 ± 0.026
3.259TyrAsn: 3.259 ± 0.059
1.5TyrPro: 1.5 ± 0.046
1.603TyrGln: 1.603 ± 0.042
1.508TyrArg: 1.508 ± 0.033
2.58TyrSer: 2.58 ± 0.056
2.498TyrThr: 2.498 ± 0.064
2.303TyrVal: 2.303 ± 0.044
0.517TyrTrp: 0.517 ± 0.023
1.845TyrTyr: 1.845 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3029 proteins (1091831 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski