Amino acid dipepetide frequency for Aspergillus steynii IBT 23096

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.439AlaAla: 8.439 ± 0.059
1.143AlaCys: 1.143 ± 0.015
4.282AlaAsp: 4.282 ± 0.024
4.867AlaGlu: 4.867 ± 0.039
3.243AlaPhe: 3.243 ± 0.028
5.781AlaGly: 5.781 ± 0.037
1.818AlaHis: 1.818 ± 0.019
4.254AlaIle: 4.254 ± 0.028
3.581AlaLys: 3.581 ± 0.028
7.816AlaLeu: 7.816 ± 0.04
1.957AlaMet: 1.957 ± 0.017
2.837AlaAsn: 2.837 ± 0.022
4.675AlaPro: 4.675 ± 0.043
3.355AlaGln: 3.355 ± 0.025
4.914AlaArg: 4.914 ± 0.032
7.244AlaSer: 7.244 ± 0.046
4.973AlaThr: 4.973 ± 0.033
5.467AlaVal: 5.467 ± 0.038
1.216AlaTrp: 1.216 ± 0.016
2.196AlaTyr: 2.196 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
1.022CysAla: 1.022 ± 0.014
0.271CysCys: 0.271 ± 0.008
0.727CysAsp: 0.727 ± 0.012
0.63CysGlu: 0.63 ± 0.011
0.608CysPhe: 0.608 ± 0.013
0.965CysGly: 0.965 ± 0.015
0.376CysHis: 0.376 ± 0.008
0.745CysIle: 0.745 ± 0.012
0.476CysLys: 0.476 ± 0.01
1.419CysLeu: 1.419 ± 0.017
0.3CysMet: 0.3 ± 0.007
0.43CysAsn: 0.43 ± 0.009
0.712CysPro: 0.712 ± 0.015
0.506CysGln: 0.506 ± 0.01
0.868CysArg: 0.868 ± 0.014
1.023CysSer: 1.023 ± 0.015
0.71CysThr: 0.71 ± 0.013
0.888CysVal: 0.888 ± 0.014
0.219CysTrp: 0.219 ± 0.006
0.39CysTyr: 0.39 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.698AspAla: 4.698 ± 0.029
0.663AspCys: 0.663 ± 0.013
3.996AspAsp: 3.996 ± 0.041
4.248AspGlu: 4.248 ± 0.036
2.229AspPhe: 2.229 ± 0.021
4.051AspGly: 4.051 ± 0.03
1.334AspHis: 1.334 ± 0.017
3.025AspIle: 3.025 ± 0.025
2.148AspLys: 2.148 ± 0.026
5.275AspLeu: 5.275 ± 0.034
1.254AspMet: 1.254 ± 0.016
1.813AspAsn: 1.813 ± 0.021
3.523AspPro: 3.523 ± 0.025
2.036AspGln: 2.036 ± 0.018
3.27AspArg: 3.27 ± 0.027
4.219AspSer: 4.219 ± 0.029
2.883AspThr: 2.883 ± 0.022
3.63AspVal: 3.63 ± 0.028
0.92AspTrp: 0.92 ± 0.012
1.616AspTyr: 1.616 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
4.983GluAla: 4.983 ± 0.044
0.666GluCys: 0.666 ± 0.011
3.975GluAsp: 3.975 ± 0.028
4.972GluGlu: 4.972 ± 0.051
1.991GluPhe: 1.991 ± 0.018
3.685GluGly: 3.685 ± 0.025
1.371GluHis: 1.371 ± 0.017
3.042GluIle: 3.042 ± 0.022
3.478GluLys: 3.478 ± 0.033
5.038GluLeu: 5.038 ± 0.039
1.454GluMet: 1.454 ± 0.016
2.305GluAsn: 2.305 ± 0.02
2.835GluPro: 2.835 ± 0.039
2.411GluGln: 2.411 ± 0.023
3.783GluArg: 3.783 ± 0.028
4.47GluSer: 4.47 ± 0.033
3.564GluThr: 3.564 ± 0.027
3.433GluVal: 3.433 ± 0.027
0.895GluTrp: 0.895 ± 0.012
1.694GluTyr: 1.694 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.138PheAla: 3.138 ± 0.021
0.629PheCys: 0.629 ± 0.01
2.335PheAsp: 2.335 ± 0.02
2.126PheGlu: 2.126 ± 0.021
1.835PhePhe: 1.835 ± 0.023
2.907PheGly: 2.907 ± 0.027
1.022PheHis: 1.022 ± 0.013
1.889PheIle: 1.889 ± 0.023
1.408PheLys: 1.408 ± 0.016
3.874PheLeu: 3.874 ± 0.028
0.821PheMet: 0.821 ± 0.013
1.465PheAsn: 1.465 ± 0.014
2.127PhePro: 2.127 ± 0.023
1.463PheGln: 1.463 ± 0.015
2.064PheArg: 2.064 ± 0.022
3.133PheSer: 3.133 ± 0.025
2.17PheThr: 2.17 ± 0.019
2.511PheVal: 2.511 ± 0.022
0.688PheTrp: 0.688 ± 0.011
1.212PheTyr: 1.212 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.31GlyAla: 5.31 ± 0.032
0.954GlyCys: 0.954 ± 0.013
3.679GlyAsp: 3.679 ± 0.025
3.633GlyGlu: 3.633 ± 0.027
2.973GlyPhe: 2.973 ± 0.028
5.549GlyGly: 5.549 ± 0.059
1.728GlyHis: 1.728 ± 0.019
3.622GlyIle: 3.622 ± 0.026
3.241GlyLys: 3.241 ± 0.032
6.365GlyLeu: 6.365 ± 0.043
1.604GlyMet: 1.604 ± 0.017
2.458GlyAsn: 2.458 ± 0.022
3.475GlyPro: 3.475 ± 0.027
2.597GlyGln: 2.597 ± 0.023
4.162GlyArg: 4.162 ± 0.026
5.833GlySer: 5.833 ± 0.038
3.88GlyThr: 3.88 ± 0.031
4.544GlyVal: 4.544 ± 0.033
1.221GlyTrp: 1.221 ± 0.018
2.203GlyTyr: 2.203 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.958HisAla: 1.958 ± 0.021
0.374HisCys: 0.374 ± 0.008
1.355HisAsp: 1.355 ± 0.018
1.367HisGlu: 1.367 ± 0.016
0.973HisPhe: 0.973 ± 0.012
1.789HisGly: 1.789 ± 0.019
0.896HisHis: 0.896 ± 0.018
1.215HisIle: 1.215 ± 0.017
0.816HisLys: 0.816 ± 0.013
2.423HisLeu: 2.423 ± 0.021
0.498HisMet: 0.498 ± 0.009
0.835HisAsn: 0.835 ± 0.013
1.829HisPro: 1.829 ± 0.02
1.012HisGln: 1.012 ± 0.013
1.642HisArg: 1.642 ± 0.02
1.909HisSer: 1.909 ± 0.019
1.285HisThr: 1.285 ± 0.013
1.506HisVal: 1.506 ± 0.016
0.381HisTrp: 0.381 ± 0.008
0.718HisTyr: 0.718 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.151IleAla: 4.151 ± 0.031
0.83IleCys: 0.83 ± 0.013
2.796IleAsp: 2.796 ± 0.023
2.74IleGlu: 2.74 ± 0.024
2.094IlePhe: 2.094 ± 0.022
3.29IleGly: 3.29 ± 0.026
1.3IleHis: 1.3 ± 0.015
2.49IleIle: 2.49 ± 0.024
1.943IleLys: 1.943 ± 0.019
4.79IleLeu: 4.79 ± 0.031
1.036IleMet: 1.036 ± 0.014
1.709IleAsn: 1.709 ± 0.018
3.202IlePro: 3.202 ± 0.026
1.982IleGln: 1.982 ± 0.018
2.802IleArg: 2.802 ± 0.023
3.848IleSer: 3.848 ± 0.024
2.74IleThr: 2.74 ± 0.025
3.199IleVal: 3.199 ± 0.024
0.736IleTrp: 0.736 ± 0.01
1.491IleTyr: 1.491 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
3.773LysAla: 3.773 ± 0.032
0.517LysCys: 0.517 ± 0.011
2.565LysAsp: 2.565 ± 0.023
3.022LysGlu: 3.022 ± 0.03
1.344LysPhe: 1.344 ± 0.014
2.796LysGly: 2.796 ± 0.027
1.055LysHis: 1.055 ± 0.014
2.075LysIle: 2.075 ± 0.022
2.846LysLys: 2.846 ± 0.04
3.713LysLeu: 3.713 ± 0.024
0.912LysMet: 0.912 ± 0.014
1.608LysAsn: 1.608 ± 0.018
2.588LysPro: 2.588 ± 0.025
1.725LysGln: 1.725 ± 0.018
3.074LysArg: 3.074 ± 0.028
3.325LysSer: 3.325 ± 0.029
2.618LysThr: 2.618 ± 0.024
2.588LysVal: 2.588 ± 0.023
0.635LysTrp: 0.635 ± 0.009
1.294LysTyr: 1.294 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
7.892LeuAla: 7.892 ± 0.039
1.291LeuCys: 1.291 ± 0.015
5.333LeuAsp: 5.333 ± 0.037
5.455LeuGlu: 5.455 ± 0.035
3.583LeuPhe: 3.583 ± 0.029
6.247LeuGly: 6.247 ± 0.039
2.388LeuHis: 2.388 ± 0.023
4.104LeuIle: 4.104 ± 0.029
3.894LeuLys: 3.894 ± 0.029
8.79LeuLeu: 8.79 ± 0.049
1.857LeuMet: 1.857 ± 0.017
3.208LeuAsn: 3.208 ± 0.022
5.611LeuPro: 5.611 ± 0.038
3.952LeuGln: 3.952 ± 0.03
5.954LeuArg: 5.954 ± 0.036
7.779LeuSer: 7.779 ± 0.047
4.852LeuThr: 4.852 ± 0.029
5.79LeuVal: 5.79 ± 0.032
1.276LeuTrp: 1.276 ± 0.016
2.499LeuTyr: 2.499 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.151MetAla: 2.151 ± 0.02
0.26MetCys: 0.26 ± 0.007
1.269MetAsp: 1.269 ± 0.017
1.285MetGlu: 1.285 ± 0.016
0.775MetPhe: 0.775 ± 0.013
1.514MetGly: 1.514 ± 0.017
0.506MetHis: 0.506 ± 0.009
1.051MetIle: 1.051 ± 0.012
0.981MetLys: 0.981 ± 0.013
1.874MetLeu: 1.874 ± 0.015
0.567MetMet: 0.567 ± 0.01
0.818MetAsn: 0.818 ± 0.013
1.267MetPro: 1.267 ± 0.016
0.861MetGln: 0.861 ± 0.012
1.261MetArg: 1.261 ± 0.014
1.86MetSer: 1.86 ± 0.019
1.324MetThr: 1.324 ± 0.016
1.375MetVal: 1.375 ± 0.018
0.259MetTrp: 0.259 ± 0.006
0.54MetTyr: 0.54 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.088AsnAla: 3.088 ± 0.024
0.455AsnCys: 0.455 ± 0.008
1.892AsnAsp: 1.892 ± 0.019
2.022AsnGlu: 2.022 ± 0.021
1.332AsnPhe: 1.332 ± 0.015
2.821AsnGly: 2.821 ± 0.023
0.896AsnHis: 0.896 ± 0.013
1.955AsnIle: 1.955 ± 0.019
1.375AsnLys: 1.375 ± 0.018
3.23AsnLeu: 3.23 ± 0.024
0.838AsnMet: 0.838 ± 0.012
1.366AsnAsn: 1.366 ± 0.016
2.559AsnPro: 2.559 ± 0.02
1.332AsnGln: 1.332 ± 0.017
1.963AsnArg: 1.963 ± 0.019
2.613AsnSer: 2.613 ± 0.022
2.08AsnThr: 2.08 ± 0.02
2.312AsnVal: 2.312 ± 0.02
0.567AsnTrp: 0.567 ± 0.009
1.067AsnTyr: 1.067 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.2ProAla: 5.2 ± 0.042
0.625ProCys: 0.625 ± 0.01
3.379ProAsp: 3.379 ± 0.026
4.003ProGlu: 4.003 ± 0.04
2.229ProPhe: 2.229 ± 0.019
4.114ProGly: 4.114 ± 0.034
1.362ProHis: 1.362 ± 0.018
2.503ProIle: 2.503 ± 0.021
2.441ProLys: 2.441 ± 0.022
4.865ProLeu: 4.865 ± 0.031
1.094ProMet: 1.094 ± 0.015
2.14ProAsn: 2.14 ± 0.023
4.81ProPro: 4.81 ± 0.063
2.457ProGln: 2.457 ± 0.03
3.59ProArg: 3.59 ± 0.032
6.261ProSer: 6.261 ± 0.042
3.818ProThr: 3.818 ± 0.032
3.743ProVal: 3.743 ± 0.024
0.814ProTrp: 0.814 ± 0.013
1.541ProTyr: 1.541 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
3.368GlnAla: 3.368 ± 0.03
0.475GlnCys: 0.475 ± 0.01
2.101GlnAsp: 2.101 ± 0.019
2.427GlnGlu: 2.427 ± 0.025
1.339GlnPhe: 1.339 ± 0.016
2.52GlnGly: 2.52 ± 0.02
1.049GlnHis: 1.049 ± 0.013
1.942GlnIle: 1.942 ± 0.017
1.926GlnLys: 1.926 ± 0.019
3.496GlnLeu: 3.496 ± 0.026
0.894GlnMet: 0.894 ± 0.012
1.577GlnAsn: 1.577 ± 0.017
2.554GlnPro: 2.554 ± 0.029
2.144GlnGln: 2.144 ± 0.04
2.638GlnArg: 2.638 ± 0.022
3.272GlnSer: 3.272 ± 0.025
2.361GlnThr: 2.361 ± 0.023
2.245GlnVal: 2.245 ± 0.022
0.608GlnTrp: 0.608 ± 0.009
1.182GlnTyr: 1.182 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
4.635ArgAla: 4.635 ± 0.031
0.813ArgCys: 0.813 ± 0.013
3.463ArgAsp: 3.463 ± 0.032
3.802ArgGlu: 3.802 ± 0.029
2.291ArgPhe: 2.291 ± 0.02
3.785ArgGly: 3.785 ± 0.03
1.611ArgHis: 1.611 ± 0.017
2.947ArgIle: 2.947 ± 0.02
3.208ArgLys: 3.208 ± 0.03
5.745ArgLeu: 5.745 ± 0.033
1.342ArgMet: 1.342 ± 0.016
2.255ArgAsn: 2.255 ± 0.019
3.57ArgPro: 3.57 ± 0.028
2.648ArgGln: 2.648 ± 0.022
5.107ArgArg: 5.107 ± 0.041
4.906ArgSer: 4.906 ± 0.039
3.263ArgThr: 3.263 ± 0.022
3.595ArgVal: 3.595 ± 0.025
1.002ArgTrp: 1.002 ± 0.013
1.724ArgTyr: 1.724 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
6.705SerAla: 6.705 ± 0.038
0.979SerCys: 0.979 ± 0.015
4.369SerAsp: 4.369 ± 0.031
4.2SerGlu: 4.2 ± 0.028
3.285SerPhe: 3.285 ± 0.023
5.653SerGly: 5.653 ± 0.031
2.081SerHis: 2.081 ± 0.023
4.037SerIle: 4.037 ± 0.031
3.461SerLys: 3.461 ± 0.026
7.695SerLeu: 7.695 ± 0.043
1.75SerMet: 1.75 ± 0.018
2.935SerAsn: 2.935 ± 0.022
5.626SerPro: 5.626 ± 0.046
3.399SerGln: 3.399 ± 0.023
5.136SerArg: 5.136 ± 0.039
8.817SerSer: 8.817 ± 0.066
5.415SerThr: 5.415 ± 0.032
4.831SerVal: 4.831 ± 0.026
1.211SerTrp: 1.211 ± 0.013
2.11SerTyr: 2.11 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
5.013ThrAla: 5.013 ± 0.032
0.786ThrCys: 0.786 ± 0.013
2.897ThrAsp: 2.897 ± 0.025
3.084ThrGlu: 3.084 ± 0.024
2.253ThrPhe: 2.253 ± 0.019
4.242ThrGly: 4.242 ± 0.028
1.323ThrHis: 1.323 ± 0.014
2.954ThrIle: 2.954 ± 0.022
2.323ThrLys: 2.323 ± 0.023
5.252ThrLeu: 5.252 ± 0.036
1.187ThrMet: 1.187 ± 0.013
1.989ThrAsn: 1.989 ± 0.017
4.152ThrPro: 4.152 ± 0.032
2.056ThrGln: 2.056 ± 0.018
3.138ThrArg: 3.138 ± 0.024
5.069ThrSer: 5.069 ± 0.032
3.936ThrThr: 3.936 ± 0.034
3.839ThrVal: 3.839 ± 0.026
0.864ThrTrp: 0.864 ± 0.012
1.603ThrTyr: 1.603 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.206ValAla: 5.206 ± 0.036
0.911ValCys: 0.911 ± 0.013
3.857ValAsp: 3.857 ± 0.029
3.753ValGlu: 3.753 ± 0.031
2.629ValPhe: 2.629 ± 0.023
4.194ValGly: 4.194 ± 0.03
1.489ValHis: 1.489 ± 0.017
3.051ValIle: 3.051 ± 0.025
2.695ValLys: 2.695 ± 0.025
5.863ValLeu: 5.863 ± 0.035
1.38ValMet: 1.38 ± 0.017
2.254ValAsn: 2.254 ± 0.021
3.655ValPro: 3.655 ± 0.026
2.475ValGln: 2.475 ± 0.02
3.58ValArg: 3.58 ± 0.024
4.925ValSer: 4.925 ± 0.031
3.471ValThr: 3.471 ± 0.029
4.443ValVal: 4.443 ± 0.035
0.911ValTrp: 0.911 ± 0.012
1.839ValTyr: 1.839 ± 0.016
0.0ValXaa: 0.0 ± 0.0
Trp
1.156TrpAla: 1.156 ± 0.015
0.202TrpCys: 0.202 ± 0.006
0.923TrpAsp: 0.923 ± 0.012
0.868TrpGlu: 0.868 ± 0.013
0.57TrpPhe: 0.57 ± 0.01
0.965TrpGly: 0.965 ± 0.013
0.377TrpHis: 0.377 ± 0.009
0.827TrpIle: 0.827 ± 0.013
0.809TrpLys: 0.809 ± 0.012
1.417TrpLeu: 1.417 ± 0.017
0.405TrpMet: 0.405 ± 0.008
0.664TrpAsn: 0.664 ± 0.01
0.634TrpPro: 0.634 ± 0.011
0.587TrpGln: 0.587 ± 0.011
1.024TrpArg: 1.024 ± 0.012
1.112TrpSer: 1.112 ± 0.015
0.978TrpThr: 0.978 ± 0.014
0.923TrpVal: 0.923 ± 0.012
0.296TrpTrp: 0.296 ± 0.008
0.462TrpTyr: 0.462 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.176TyrAla: 2.176 ± 0.021
0.441TyrCys: 0.441 ± 0.009
1.634TyrAsp: 1.634 ± 0.019
1.544TyrGlu: 1.544 ± 0.016
1.241TyrPhe: 1.241 ± 0.016
2.165TyrGly: 2.165 ± 0.021
0.799TyrHis: 0.799 ± 0.013
1.443TyrIle: 1.443 ± 0.015
1.021TyrLys: 1.021 ± 0.015
2.815TyrLeu: 2.815 ± 0.023
0.643TyrMet: 0.643 ± 0.01
1.097TyrAsn: 1.097 ± 0.014
1.585TyrPro: 1.585 ± 0.017
1.121TyrGln: 1.121 ± 0.015
1.738TyrArg: 1.738 ± 0.018
2.076TyrSer: 2.076 ± 0.019
1.643TyrThr: 1.643 ± 0.018
1.719TyrVal: 1.719 ± 0.018
0.478TyrTrp: 0.478 ± 0.01
0.965TyrTyr: 0.965 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12965 proteins (6073799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski