Amino acid dipepetide frequency for Purpureocillium lilacinum (Paecilomyces lilacinus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.424AlaAla: 12.424 ± 0.075
1.281AlaCys: 1.281 ± 0.017
5.126AlaAsp: 5.126 ± 0.037
5.529AlaGlu: 5.529 ± 0.046
3.275AlaPhe: 3.275 ± 0.025
7.16AlaGly: 7.16 ± 0.047
2.031AlaHis: 2.031 ± 0.018
4.08AlaIle: 4.08 ± 0.032
4.257AlaLys: 4.257 ± 0.033
8.484AlaLeu: 8.484 ± 0.041
2.289AlaMet: 2.289 ± 0.022
2.98AlaAsn: 2.98 ± 0.022
5.699AlaPro: 5.699 ± 0.046
3.715AlaGln: 3.715 ± 0.032
6.25AlaArg: 6.25 ± 0.034
8.043AlaSer: 8.043 ± 0.042
5.9AlaThr: 5.9 ± 0.038
6.348AlaVal: 6.348 ± 0.043
1.399AlaTrp: 1.399 ± 0.019
2.19AlaTyr: 2.19 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
1.152CysAla: 1.152 ± 0.015
0.312CysCys: 0.312 ± 0.011
0.721CysAsp: 0.721 ± 0.013
0.592CysGlu: 0.592 ± 0.011
0.545CysPhe: 0.545 ± 0.01
1.077CysGly: 1.077 ± 0.018
0.36CysHis: 0.36 ± 0.009
0.656CysIle: 0.656 ± 0.013
0.537CysLys: 0.537 ± 0.012
1.37CysLeu: 1.37 ± 0.018
0.296CysMet: 0.296 ± 0.007
0.424CysAsn: 0.424 ± 0.009
0.78CysPro: 0.78 ± 0.015
0.471CysGln: 0.471 ± 0.01
0.943CysArg: 0.943 ± 0.017
0.964CysSer: 0.964 ± 0.016
0.712CysThr: 0.712 ± 0.013
0.895CysVal: 0.895 ± 0.014
0.243CysTrp: 0.243 ± 0.008
0.354CysTyr: 0.354 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
5.844AspAla: 5.844 ± 0.042
0.677AspCys: 0.677 ± 0.011
4.899AspAsp: 4.899 ± 0.051
4.723AspGlu: 4.723 ± 0.042
2.125AspPhe: 2.125 ± 0.022
4.822AspGly: 4.822 ± 0.035
1.219AspHis: 1.219 ± 0.018
2.659AspIle: 2.659 ± 0.023
2.595AspLys: 2.595 ± 0.026
4.846AspLeu: 4.846 ± 0.035
1.392AspMet: 1.392 ± 0.015
1.767AspAsn: 1.767 ± 0.017
3.248AspPro: 3.248 ± 0.025
1.782AspGln: 1.782 ± 0.019
3.244AspArg: 3.244 ± 0.029
3.943AspSer: 3.943 ± 0.034
2.815AspThr: 2.815 ± 0.023
4.045AspVal: 4.045 ± 0.03
0.907AspTrp: 0.907 ± 0.015
1.463AspTyr: 1.463 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.862GluAla: 5.862 ± 0.049
0.652GluCys: 0.652 ± 0.013
3.964GluAsp: 3.964 ± 0.041
4.7GluGlu: 4.7 ± 0.054
1.785GluPhe: 1.785 ± 0.021
3.718GluGly: 3.718 ± 0.029
1.409GluHis: 1.409 ± 0.016
2.326GluIle: 2.326 ± 0.023
3.008GluLys: 3.008 ± 0.028
5.025GluLeu: 5.025 ± 0.038
1.393GluMet: 1.393 ± 0.015
1.744GluAsn: 1.744 ± 0.021
2.94GluPro: 2.94 ± 0.036
2.398GluGln: 2.398 ± 0.025
4.01GluArg: 4.01 ± 0.038
3.928GluSer: 3.928 ± 0.032
3.284GluThr: 3.284 ± 0.03
3.389GluVal: 3.389 ± 0.031
0.864GluTrp: 0.864 ± 0.014
1.491GluTyr: 1.491 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.244PheAla: 3.244 ± 0.026
0.566PheCys: 0.566 ± 0.011
2.258PheAsp: 2.258 ± 0.023
1.954PheGlu: 1.954 ± 0.018
1.55PhePhe: 1.55 ± 0.021
2.833PheGly: 2.833 ± 0.028
0.851PheHis: 0.851 ± 0.013
1.561PheIle: 1.561 ± 0.019
1.373PheLys: 1.373 ± 0.018
3.242PheLeu: 3.242 ± 0.028
0.778PheMet: 0.778 ± 0.012
1.3PheAsn: 1.3 ± 0.019
1.806PhePro: 1.806 ± 0.02
1.238PheGln: 1.238 ± 0.016
1.968PheArg: 1.968 ± 0.02
2.634PheSer: 2.634 ± 0.024
1.969PheThr: 1.969 ± 0.021
2.498PheVal: 2.498 ± 0.026
0.646PheTrp: 0.646 ± 0.012
1.017PheTyr: 1.017 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
6.59GlyAla: 6.59 ± 0.045
1.017GlyCys: 1.017 ± 0.017
4.182GlyAsp: 4.182 ± 0.03
3.76GlyGlu: 3.76 ± 0.032
2.778GlyPhe: 2.778 ± 0.028
6.952GlyGly: 6.952 ± 0.073
1.91GlyHis: 1.91 ± 0.023
3.31GlyIle: 3.31 ± 0.032
3.493GlyLys: 3.493 ± 0.03
6.264GlyLeu: 6.264 ± 0.04
1.732GlyMet: 1.732 ± 0.019
2.506GlyAsn: 2.506 ± 0.029
3.734GlyPro: 3.734 ± 0.028
2.743GlyGln: 2.743 ± 0.028
5.004GlyArg: 5.004 ± 0.034
6.122GlySer: 6.122 ± 0.036
4.171GlyThr: 4.171 ± 0.032
4.723GlyVal: 4.723 ± 0.037
1.282GlyTrp: 1.282 ± 0.016
2.051GlyTyr: 2.051 ± 0.023
0.001GlyXaa: 0.001 ± 0.0
His
2.142HisAla: 2.142 ± 0.02
0.377HisCys: 0.377 ± 0.01
1.5HisAsp: 1.5 ± 0.019
1.315HisGlu: 1.315 ± 0.017
0.871HisPhe: 0.871 ± 0.013
2.047HisGly: 2.047 ± 0.022
1.007HisHis: 1.007 ± 0.019
1.054HisIle: 1.054 ± 0.013
0.868HisLys: 0.868 ± 0.013
2.173HisLeu: 2.173 ± 0.024
0.519HisMet: 0.519 ± 0.01
0.743HisAsn: 0.743 ± 0.013
1.669HisPro: 1.669 ± 0.022
1.02HisGln: 1.02 ± 0.017
1.695HisArg: 1.695 ± 0.021
1.672HisSer: 1.672 ± 0.022
1.162HisThr: 1.162 ± 0.014
1.585HisVal: 1.585 ± 0.018
0.355HisTrp: 0.355 ± 0.008
0.634HisTyr: 0.634 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.927IleAla: 3.927 ± 0.034
0.671IleCys: 0.671 ± 0.011
2.544IleAsp: 2.544 ± 0.024
2.341IleGlu: 2.341 ± 0.023
1.708IlePhe: 1.708 ± 0.021
2.934IleGly: 2.934 ± 0.026
1.024IleHis: 1.024 ± 0.013
2.029IleIle: 2.029 ± 0.025
1.873IleLys: 1.873 ± 0.023
3.864IleLeu: 3.864 ± 0.035
0.942IleMet: 0.942 ± 0.013
1.459IleAsn: 1.459 ± 0.017
2.565IlePro: 2.565 ± 0.022
1.538IleGln: 1.538 ± 0.02
2.57IleArg: 2.57 ± 0.022
3.016IleSer: 3.016 ± 0.024
2.391IleThr: 2.391 ± 0.024
3.028IleVal: 3.028 ± 0.026
0.66IleTrp: 0.66 ± 0.012
1.158IleTyr: 1.158 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
4.417LysAla: 4.417 ± 0.036
0.499LysCys: 0.499 ± 0.012
2.627LysAsp: 2.627 ± 0.029
2.829LysGlu: 2.829 ± 0.032
1.3LysPhe: 1.3 ± 0.017
2.96LysGly: 2.96 ± 0.027
1.073LysHis: 1.073 ± 0.017
1.821LysIle: 1.821 ± 0.022
2.954LysLys: 2.954 ± 0.046
3.761LysLeu: 3.761 ± 0.029
0.976LysMet: 0.976 ± 0.015
1.412LysAsn: 1.412 ± 0.019
2.641LysPro: 2.641 ± 0.025
1.727LysGln: 1.727 ± 0.021
3.281LysArg: 3.281 ± 0.03
3.023LysSer: 3.023 ± 0.026
2.66LysThr: 2.66 ± 0.024
2.641LysVal: 2.641 ± 0.026
0.622LysTrp: 0.622 ± 0.012
1.224LysTyr: 1.224 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
8.752LeuAla: 8.752 ± 0.049
1.284LeuCys: 1.284 ± 0.017
5.232LeuAsp: 5.232 ± 0.034
5.078LeuGlu: 5.078 ± 0.041
3.145LeuPhe: 3.145 ± 0.032
6.219LeuGly: 6.219 ± 0.042
2.182LeuHis: 2.182 ± 0.022
3.406LeuIle: 3.406 ± 0.029
3.657LeuLys: 3.657 ± 0.038
8.263LeuLeu: 8.263 ± 0.06
1.767LeuMet: 1.767 ± 0.018
2.746LeuAsn: 2.746 ± 0.022
5.451LeuPro: 5.451 ± 0.037
3.554LeuGln: 3.554 ± 0.025
6.195LeuArg: 6.195 ± 0.036
6.768LeuSer: 6.768 ± 0.047
4.562LeuThr: 4.562 ± 0.031
5.731LeuVal: 5.731 ± 0.041
1.216LeuTrp: 1.216 ± 0.015
2.187LeuTyr: 2.187 ± 0.023
0.0LeuXaa: 0.0 ± 0.0
Met
2.563MetAla: 2.563 ± 0.021
0.265MetCys: 0.265 ± 0.007
1.305MetAsp: 1.305 ± 0.016
1.228MetGlu: 1.228 ± 0.018
0.738MetPhe: 0.738 ± 0.01
1.559MetGly: 1.559 ± 0.019
0.518MetHis: 0.518 ± 0.01
0.854MetIle: 0.854 ± 0.014
0.924MetLys: 0.924 ± 0.015
1.893MetLeu: 1.893 ± 0.019
0.609MetMet: 0.609 ± 0.012
0.702MetAsn: 0.702 ± 0.012
1.419MetPro: 1.419 ± 0.017
0.864MetGln: 0.864 ± 0.015
1.4MetArg: 1.4 ± 0.016
1.852MetSer: 1.852 ± 0.02
1.336MetThr: 1.336 ± 0.016
1.372MetVal: 1.372 ± 0.018
0.284MetTrp: 0.284 ± 0.007
0.519MetTyr: 0.519 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.039AsnAla: 3.039 ± 0.023
0.43AsnCys: 0.43 ± 0.01
1.792AsnAsp: 1.792 ± 0.02
1.699AsnGlu: 1.699 ± 0.02
1.18AsnPhe: 1.18 ± 0.015
2.881AsnGly: 2.881 ± 0.03
0.731AsnHis: 0.731 ± 0.014
1.602AsnIle: 1.602 ± 0.018
1.373AsnLys: 1.373 ± 0.02
2.778AsnLeu: 2.778 ± 0.024
0.785AsnMet: 0.785 ± 0.013
1.216AsnAsn: 1.216 ± 0.02
2.109AsnPro: 2.109 ± 0.021
1.079AsnGln: 1.079 ± 0.017
1.837AsnArg: 1.837 ± 0.018
2.23AsnSer: 2.23 ± 0.02
1.829AsnThr: 1.829 ± 0.02
2.184AsnVal: 2.184 ± 0.021
0.528AsnTrp: 0.528 ± 0.012
0.881AsnTyr: 0.881 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
6.291ProAla: 6.291 ± 0.048
0.656ProCys: 0.656 ± 0.014
3.39ProAsp: 3.39 ± 0.03
3.625ProGlu: 3.625 ± 0.033
1.995ProPhe: 1.995 ± 0.022
4.499ProGly: 4.499 ± 0.036
1.369ProHis: 1.369 ± 0.018
2.14ProIle: 2.14 ± 0.022
2.502ProLys: 2.502 ± 0.026
4.728ProLeu: 4.728 ± 0.027
1.134ProMet: 1.134 ± 0.016
1.89ProAsn: 1.89 ± 0.024
5.618ProPro: 5.618 ± 0.078
2.511ProGln: 2.511 ± 0.033
3.954ProArg: 3.954 ± 0.035
5.83ProSer: 5.83 ± 0.048
3.808ProThr: 3.808 ± 0.031
3.796ProVal: 3.796 ± 0.033
0.852ProTrp: 0.852 ± 0.014
1.41ProTyr: 1.41 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.715GlnAla: 3.715 ± 0.032
0.463GlnCys: 0.463 ± 0.009
2.062GlnAsp: 2.062 ± 0.023
2.137GlnGlu: 2.137 ± 0.022
1.197GlnPhe: 1.197 ± 0.016
2.621GlnGly: 2.621 ± 0.027
1.158GlnHis: 1.158 ± 0.018
1.552GlnIle: 1.552 ± 0.017
1.672GlnLys: 1.672 ± 0.022
3.403GlnLeu: 3.403 ± 0.031
0.903GlnMet: 0.903 ± 0.015
1.233GlnAsn: 1.233 ± 0.018
2.675GlnPro: 2.675 ± 0.036
2.818GlnGln: 2.818 ± 0.064
2.824GlnArg: 2.824 ± 0.029
2.828GlnSer: 2.828 ± 0.026
2.183GlnThr: 2.183 ± 0.019
2.211GlnVal: 2.211 ± 0.023
0.578GlnTrp: 0.578 ± 0.011
1.047GlnTyr: 1.047 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
5.863ArgAla: 5.863 ± 0.037
0.929ArgCys: 0.929 ± 0.012
3.914ArgAsp: 3.914 ± 0.028
3.888ArgGlu: 3.888 ± 0.034
2.191ArgPhe: 2.191 ± 0.023
4.485ArgGly: 4.485 ± 0.034
1.857ArgHis: 1.857 ± 0.022
2.713ArgIle: 2.713 ± 0.024
3.2ArgLys: 3.2 ± 0.03
5.981ArgLeu: 5.981 ± 0.037
1.421ArgMet: 1.421 ± 0.017
2.114ArgAsn: 2.114 ± 0.02
4.115ArgPro: 4.115 ± 0.033
2.904ArgGln: 2.904 ± 0.027
6.393ArgArg: 6.393 ± 0.05
4.909ArgSer: 4.909 ± 0.037
3.447ArgThr: 3.447 ± 0.03
3.836ArgVal: 3.836 ± 0.028
1.093ArgTrp: 1.093 ± 0.016
1.615ArgTyr: 1.615 ± 0.018
0.001ArgXaa: 0.001 ± 0.0
Ser
7.163SerAla: 7.163 ± 0.046
0.984SerCys: 0.984 ± 0.017
4.105SerAsp: 4.105 ± 0.031
3.636SerGlu: 3.636 ± 0.031
2.787SerPhe: 2.787 ± 0.025
5.807SerGly: 5.807 ± 0.039
1.889SerHis: 1.889 ± 0.019
3.338SerIle: 3.338 ± 0.029
3.285SerLys: 3.285 ± 0.026
6.692SerLeu: 6.692 ± 0.044
1.704SerMet: 1.704 ± 0.02
2.519SerAsn: 2.519 ± 0.023
5.393SerPro: 5.393 ± 0.052
3.02SerGln: 3.02 ± 0.034
5.266SerArg: 5.266 ± 0.041
8.211SerSer: 8.211 ± 0.068
5.009SerThr: 5.009 ± 0.04
4.537SerVal: 4.537 ± 0.033
1.157SerTrp: 1.157 ± 0.017
1.818SerTyr: 1.818 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
5.72ThrAla: 5.72 ± 0.036
0.79ThrCys: 0.79 ± 0.013
2.811ThrAsp: 2.811 ± 0.025
2.848ThrGlu: 2.848 ± 0.029
2.025ThrPhe: 2.025 ± 0.019
4.328ThrGly: 4.328 ± 0.033
1.263ThrHis: 1.263 ± 0.015
2.659ThrIle: 2.659 ± 0.026
2.392ThrLys: 2.392 ± 0.025
4.97ThrLeu: 4.97 ± 0.033
1.183ThrMet: 1.183 ± 0.015
1.807ThrAsn: 1.807 ± 0.019
4.173ThrPro: 4.173 ± 0.038
1.943ThrGln: 1.943 ± 0.02
3.331ThrArg: 3.331 ± 0.026
4.863ThrSer: 4.863 ± 0.035
4.192ThrThr: 4.192 ± 0.048
3.775ThrVal: 3.775 ± 0.03
0.886ThrTrp: 0.886 ± 0.013
1.486ThrTyr: 1.486 ± 0.015
0.0ThrXaa: 0.0 ± 0.0
Val
6.303ValAla: 6.303 ± 0.04
0.907ValCys: 0.907 ± 0.014
4.046ValAsp: 4.046 ± 0.03
3.81ValGlu: 3.81 ± 0.038
2.458ValPhe: 2.458 ± 0.026
4.384ValGly: 4.384 ± 0.031
1.459ValHis: 1.459 ± 0.015
2.641ValIle: 2.641 ± 0.03
2.739ValLys: 2.739 ± 0.025
5.823ValLeu: 5.823 ± 0.038
1.394ValMet: 1.394 ± 0.015
2.085ValAsn: 2.085 ± 0.02
3.916ValPro: 3.916 ± 0.031
2.363ValGln: 2.363 ± 0.024
3.986ValArg: 3.986 ± 0.03
4.65ValSer: 4.65 ± 0.03
3.595ValThr: 3.595 ± 0.029
4.782ValVal: 4.782 ± 0.04
0.912ValTrp: 0.912 ± 0.013
1.691ValTyr: 1.691 ± 0.018
0.0ValXaa: 0.0 ± 0.0
Trp
1.322TrpAla: 1.322 ± 0.017
0.223TrpCys: 0.223 ± 0.006
0.935TrpAsp: 0.935 ± 0.014
0.81TrpGlu: 0.81 ± 0.013
0.547TrpPhe: 0.547 ± 0.01
0.994TrpGly: 0.994 ± 0.016
0.409TrpHis: 0.409 ± 0.009
0.678TrpIle: 0.678 ± 0.013
0.738TrpLys: 0.738 ± 0.013
1.439TrpLeu: 1.439 ± 0.02
0.38TrpMet: 0.38 ± 0.009
0.587TrpAsn: 0.587 ± 0.01
0.71TrpPro: 0.71 ± 0.015
0.612TrpGln: 0.612 ± 0.011
1.124TrpArg: 1.124 ± 0.017
1.065TrpSer: 1.065 ± 0.016
0.986TrpThr: 0.986 ± 0.014
0.934TrpVal: 0.934 ± 0.014
0.318TrpTrp: 0.318 ± 0.008
0.438TrpTyr: 0.438 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.127TyrAla: 2.127 ± 0.019
0.421TyrCys: 0.421 ± 0.009
1.603TyrAsp: 1.603 ± 0.017
1.39TyrGlu: 1.39 ± 0.016
1.085TyrPhe: 1.085 ± 0.015
2.048TyrGly: 2.048 ± 0.021
0.689TyrHis: 0.689 ± 0.011
1.16TyrIle: 1.16 ± 0.013
1.007TyrLys: 1.007 ± 0.014
2.4TyrLeu: 2.4 ± 0.024
0.61TyrMet: 0.61 ± 0.01
0.949TyrAsn: 0.949 ± 0.013
1.335TyrPro: 1.335 ± 0.018
0.976TyrGln: 0.976 ± 0.014
1.595TyrArg: 1.595 ± 0.017
1.767TyrSer: 1.767 ± 0.018
1.434TyrThr: 1.434 ± 0.017
1.635TyrVal: 1.635 ± 0.02
0.444TyrTrp: 0.444 ± 0.01
0.845TyrTyr: 0.845 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.121XaaXaa: 0.121 ± 0.059
Statistics based on 11750 proteins (5417339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski