Amino acid dipepetide frequency for Furcraea necrotic streak virus (isolate/Colombia/Furcraea/Cauca/2004)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.316AlaAla: 3.316 ± 1.043
0.0AlaCys: 0.0 ± 0.0
2.653AlaAsp: 2.653 ± 1.203
3.979AlaGlu: 3.979 ± 1.473
0.663AlaPhe: 0.663 ± 0.402
5.968AlaGly: 5.968 ± 0.974
0.0AlaHis: 0.0 ± 0.0
5.305AlaIle: 5.305 ± 2.198
1.326AlaLys: 1.326 ± 0.803
9.284AlaLeu: 9.284 ± 2.097
1.326AlaMet: 1.326 ± 0.55
5.968AlaAsn: 5.968 ± 0.912
4.642AlaPro: 4.642 ± 0.957
3.316AlaGln: 3.316 ± 0.9
6.631AlaArg: 6.631 ± 2.067
1.326AlaSer: 1.326 ± 0.55
3.979AlaThr: 3.979 ± 2.371
6.631AlaVal: 6.631 ± 0.981
0.663AlaTrp: 0.663 ± 0.654
2.653AlaTyr: 2.653 ± 0.941
0.0AlaXaa: 0.0 ± 0.0
Cys
1.326CysAla: 1.326 ± 0.55
0.663CysCys: 0.663 ± 1.306
1.989CysAsp: 1.989 ± 1.429
1.326CysGlu: 1.326 ± 0.55
1.326CysPhe: 1.326 ± 0.803
0.663CysGly: 0.663 ± 0.402
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.653CysLys: 2.653 ± 0.457
3.316CysLeu: 3.316 ± 1.277
0.0CysMet: 0.0 ± 0.0
1.326CysAsn: 1.326 ± 0.601
0.0CysPro: 0.0 ± 0.0
1.326CysGln: 1.326 ± 0.803
1.326CysArg: 1.326 ± 1.216
1.326CysSer: 1.326 ± 1.307
1.989CysThr: 1.989 ± 0.629
0.663CysVal: 0.663 ± 0.402
0.663CysTrp: 0.663 ± 0.654
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.958AspAla: 7.958 ± 2.122
1.989AspCys: 1.989 ± 0.68
0.663AspAsp: 0.663 ± 0.402
4.642AspGlu: 4.642 ± 1.582
1.326AspPhe: 1.326 ± 0.55
3.316AspGly: 3.316 ± 1.012
0.663AspHis: 0.663 ± 0.402
2.653AspIle: 2.653 ± 0.457
1.326AspLys: 1.326 ± 1.307
1.989AspLeu: 1.989 ± 1.205
1.989AspMet: 1.989 ± 0.68
2.653AspAsn: 2.653 ± 1.752
5.305AspPro: 5.305 ± 1.641
1.989AspGln: 1.989 ± 1.256
2.653AspArg: 2.653 ± 1.196
1.326AspSer: 1.326 ± 1.307
3.316AspThr: 3.316 ± 0.588
2.653AspVal: 2.653 ± 0.941
0.0AspTrp: 0.0 ± 0.0
1.989AspTyr: 1.989 ± 0.629
0.663AspXaa: 0.663 ± 0.402
Glu
4.642GluAla: 4.642 ± 1.024
0.0GluCys: 0.0 ± 0.0
5.968GluAsp: 5.968 ± 2.154
2.653GluGlu: 2.653 ± 0.941
1.989GluPhe: 1.989 ± 0.68
3.316GluGly: 3.316 ± 1.043
2.653GluHis: 2.653 ± 0.941
3.316GluIle: 3.316 ± 1.043
3.316GluLys: 3.316 ± 1.277
1.989GluLeu: 1.989 ± 0.707
0.0GluMet: 0.0 ± 0.0
0.663GluAsn: 0.663 ± 0.402
2.653GluPro: 2.653 ± 1.325
1.989GluGln: 1.989 ± 1.06
6.631GluArg: 6.631 ± 2.039
4.642GluSer: 4.642 ± 1.684
1.989GluThr: 1.989 ± 0.874
3.979GluVal: 3.979 ± 1.36
3.316GluTrp: 3.316 ± 0.588
0.663GluTyr: 0.663 ± 1.053
0.0GluXaa: 0.0 ± 0.0
Phe
5.968PheAla: 5.968 ± 1.609
1.989PheCys: 1.989 ± 0.68
2.653PheAsp: 2.653 ± 1.099
1.326PheGlu: 1.326 ± 0.803
0.663PhePhe: 0.663 ± 1.306
4.642PheGly: 4.642 ± 1.024
0.663PheHis: 0.663 ± 0.402
3.316PheIle: 3.316 ± 1.043
1.326PheLys: 1.326 ± 0.803
2.653PheLeu: 2.653 ± 0.941
2.653PheMet: 2.653 ± 0.997
2.653PheAsn: 2.653 ± 1.096
2.653PhePro: 2.653 ± 1.081
0.0PheGln: 0.0 ± 0.0
2.653PheArg: 2.653 ± 1.01
3.979PheSer: 3.979 ± 1.649
2.653PheThr: 2.653 ± 1.099
1.326PheVal: 1.326 ± 0.803
0.663PheTrp: 0.663 ± 0.402
2.653PheTyr: 2.653 ± 1.27
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
2.653GlyCys: 2.653 ± 0.941
3.316GlyAsp: 3.316 ± 1.365
3.979GlyGlu: 3.979 ± 0.662
3.979GlyPhe: 3.979 ± 1.643
3.979GlyGly: 3.979 ± 1.505
0.0GlyHis: 0.0 ± 0.0
5.968GlyIle: 5.968 ± 1.427
1.989GlyLys: 1.989 ± 1.429
7.958GlyLeu: 7.958 ± 1.434
1.989GlyMet: 1.989 ± 1.06
3.979GlyAsn: 3.979 ± 1.502
1.326GlyPro: 1.326 ± 1.307
2.653GlyGln: 2.653 ± 2.196
7.958GlyArg: 7.958 ± 2.047
2.653GlySer: 2.653 ± 1.099
1.326GlyThr: 1.326 ± 1.367
8.621GlyVal: 8.621 ± 2.821
1.326GlyTrp: 1.326 ± 0.601
1.326GlyTyr: 1.326 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
1.326HisAla: 1.326 ± 0.803
1.326HisCys: 1.326 ± 1.307
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.326HisPhe: 1.326 ± 0.803
2.653HisGly: 2.653 ± 1.01
1.326HisHis: 1.326 ± 1.367
0.663HisIle: 0.663 ± 1.306
1.989HisLys: 1.989 ± 1.205
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
2.653HisAsn: 2.653 ± 1.325
0.663HisPro: 0.663 ± 0.402
1.326HisGln: 1.326 ± 2.106
0.663HisArg: 0.663 ± 0.402
0.663HisSer: 0.663 ± 0.402
2.653HisThr: 2.653 ± 1.203
2.653HisVal: 2.653 ± 1.325
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.979IleAla: 3.979 ± 1.473
1.326IleCys: 1.326 ± 0.803
1.326IleAsp: 1.326 ± 0.55
3.979IleGlu: 3.979 ± 1.643
1.989IlePhe: 1.989 ± 0.707
4.642IleGly: 4.642 ± 1.664
0.0IleHis: 0.0 ± 0.0
1.326IleIle: 1.326 ± 0.803
1.989IleLys: 1.989 ± 0.68
1.989IleLeu: 1.989 ± 2.592
1.326IleMet: 1.326 ± 0.803
3.979IleAsn: 3.979 ± 0.813
4.642IlePro: 4.642 ± 0.957
2.653IleGln: 2.653 ± 1.752
3.979IleArg: 3.979 ± 1.325
3.316IleSer: 3.316 ± 1.944
5.968IleThr: 5.968 ± 2.813
1.326IleVal: 1.326 ± 0.601
0.663IleTrp: 0.663 ± 0.402
2.653IleTyr: 2.653 ± 1.133
0.0IleXaa: 0.0 ± 0.0
Lys
2.653LysAla: 2.653 ± 1.096
1.326LysCys: 1.326 ± 1.216
3.316LysAsp: 3.316 ± 0.923
2.653LysGlu: 2.653 ± 1.096
0.663LysPhe: 0.663 ± 0.654
2.653LysGly: 2.653 ± 0.941
0.663LysHis: 0.663 ± 0.402
1.989LysIle: 1.989 ± 1.256
2.653LysLys: 2.653 ± 0.457
5.968LysLeu: 5.968 ± 1.707
1.326LysMet: 1.326 ± 1.355
1.326LysAsn: 1.326 ± 0.601
3.316LysPro: 3.316 ± 1.277
1.989LysGln: 1.989 ± 0.707
3.979LysArg: 3.979 ± 0.813
1.326LysSer: 1.326 ± 0.977
4.642LysThr: 4.642 ± 1.26
4.642LysVal: 4.642 ± 1.712
1.326LysTrp: 1.326 ± 0.977
1.989LysTyr: 1.989 ± 0.68
0.663LysXaa: 0.663 ± 0.402
Leu
7.958LeuAla: 7.958 ± 2.504
0.0LeuCys: 0.0 ± 0.0
3.979LeuAsp: 3.979 ± 0.897
5.305LeuGlu: 5.305 ± 1.88
2.653LeuPhe: 2.653 ± 1.684
3.316LeuGly: 3.316 ± 1.665
1.326LeuHis: 1.326 ± 0.55
4.642LeuIle: 4.642 ± 1.353
3.316LeuLys: 3.316 ± 0.588
9.947LeuLeu: 9.947 ± 2.237
2.653LeuMet: 2.653 ± 1.524
0.663LeuAsn: 0.663 ± 0.402
3.979LeuPro: 3.979 ± 1.36
2.653LeuGln: 2.653 ± 1.099
4.642LeuArg: 4.642 ± 2.023
5.305LeuSer: 5.305 ± 2.329
4.642LeuThr: 4.642 ± 1.729
6.631LeuVal: 6.631 ± 3.606
1.989LeuTrp: 1.989 ± 0.68
1.326LeuTyr: 1.326 ± 0.55
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.989MetCys: 1.989 ± 0.707
0.663MetAsp: 0.663 ± 0.402
2.653MetGlu: 2.653 ± 1.058
0.663MetPhe: 0.663 ± 0.654
1.326MetGly: 1.326 ± 0.601
0.663MetHis: 0.663 ± 0.402
0.663MetIle: 0.663 ± 0.402
1.326MetLys: 1.326 ± 1.367
0.0MetLeu: 0.0 ± 0.0
0.663MetMet: 0.663 ± 0.654
1.989MetAsn: 1.989 ± 0.68
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.663MetArg: 0.663 ± 0.402
1.989MetSer: 1.989 ± 1.139
1.326MetThr: 1.326 ± 2.106
3.979MetVal: 3.979 ± 0.662
0.663MetTrp: 0.663 ± 0.654
1.326MetTyr: 1.326 ± 0.803
0.0MetXaa: 0.0 ± 0.0
Asn
2.653AsnAla: 2.653 ± 0.457
1.326AsnCys: 1.326 ± 0.55
2.653AsnAsp: 2.653 ± 1.099
1.326AsnGlu: 1.326 ± 2.611
1.989AsnPhe: 1.989 ± 2.05
2.653AsnGly: 2.653 ± 1.196
0.663AsnHis: 0.663 ± 0.402
3.316AsnIle: 3.316 ± 1.735
2.653AsnLys: 2.653 ± 1.096
1.326AsnLeu: 1.326 ± 0.55
0.663AsnMet: 0.663 ± 0.402
3.979AsnAsn: 3.979 ± 0.813
1.326AsnPro: 1.326 ± 1.307
0.0AsnGln: 0.0 ± 0.0
5.968AsnArg: 5.968 ± 2.154
3.979AsnSer: 3.979 ± 1.505
2.653AsnThr: 2.653 ± 0.457
4.642AsnVal: 4.642 ± 0.763
0.0AsnTrp: 0.0 ± 0.0
0.663AsnTyr: 0.663 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
2.653ProAla: 2.653 ± 0.457
0.663ProCys: 0.663 ± 0.654
1.989ProAsp: 1.989 ± 1.205
3.979ProGlu: 3.979 ± 0.917
1.326ProPhe: 1.326 ± 1.216
1.326ProGly: 1.326 ± 0.803
0.663ProHis: 0.663 ± 1.306
3.316ProIle: 3.316 ± 1.277
3.316ProLys: 3.316 ± 1.043
2.653ProLeu: 2.653 ± 1.445
2.653ProMet: 2.653 ± 1.133
0.663ProAsn: 0.663 ± 0.654
5.305ProPro: 5.305 ± 1.963
1.326ProGln: 1.326 ± 0.803
5.968ProArg: 5.968 ± 2.801
3.316ProSer: 3.316 ± 0.588
3.979ProThr: 3.979 ± 0.995
7.958ProVal: 7.958 ± 1.282
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.979GlnAla: 3.979 ± 0.662
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.663GlnGlu: 0.663 ± 0.402
1.326GlnPhe: 1.326 ± 0.55
5.305GlnGly: 5.305 ± 1.94
3.979GlnHis: 3.979 ± 2.383
2.653GlnIle: 2.653 ± 0.804
3.979GlnLys: 3.979 ± 1.318
0.663GlnLeu: 0.663 ± 0.654
0.663GlnMet: 0.663 ± 0.402
0.0GlnAsn: 0.0 ± 0.0
3.979GlnPro: 3.979 ± 1.414
1.989GlnGln: 1.989 ± 0.68
3.316GlnArg: 3.316 ± 0.588
1.989GlnSer: 1.989 ± 1.814
1.326GlnThr: 1.326 ± 2.106
1.326GlnVal: 1.326 ± 1.307
0.0GlnTrp: 0.0 ± 0.0
0.663GlnTyr: 0.663 ± 1.306
0.0GlnXaa: 0.0 ± 0.0
Arg
7.958ArgAla: 7.958 ± 2.038
1.326ArgCys: 1.326 ± 1.216
6.631ArgAsp: 6.631 ± 1.2
4.642ArgGlu: 4.642 ± 1.592
11.273ArgPhe: 11.273 ± 2.216
2.653ArgGly: 2.653 ± 1.607
0.0ArgHis: 0.0 ± 0.0
2.653ArgIle: 2.653 ± 0.457
5.305ArgLys: 5.305 ± 1.024
5.305ArgLeu: 5.305 ± 1.146
0.663ArgMet: 0.663 ± 0.402
3.316ArgAsn: 3.316 ± 1.043
3.316ArgPro: 3.316 ± 1.558
2.653ArgGln: 2.653 ± 0.941
3.979ArgArg: 3.979 ± 1.473
4.642ArgSer: 4.642 ± 1.419
3.316ArgThr: 3.316 ± 0.785
5.968ArgVal: 5.968 ± 1.018
0.663ArgTrp: 0.663 ± 0.402
2.653ArgTyr: 2.653 ± 1.01
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.663SerCys: 0.663 ± 0.654
1.989SerAsp: 1.989 ± 1.814
1.989SerGlu: 1.989 ± 1.139
2.653SerPhe: 2.653 ± 1.099
5.305SerGly: 5.305 ± 1.608
4.642SerHis: 4.642 ± 1.658
1.989SerIle: 1.989 ± 1.961
3.316SerLys: 3.316 ± 1.527
9.947SerLeu: 9.947 ± 3.013
0.0SerMet: 0.0 ± 0.0
1.326SerAsn: 1.326 ± 1.307
2.653SerPro: 2.653 ± 1.684
1.326SerGln: 1.326 ± 1.098
6.631SerArg: 6.631 ± 0.994
6.631SerSer: 6.631 ± 4.833
5.968SerThr: 5.968 ± 2.594
1.989SerVal: 1.989 ± 0.874
0.0SerTrp: 0.0 ± 0.0
0.663SerTyr: 0.663 ± 0.654
0.0SerXaa: 0.0 ± 0.0
Thr
5.305ThrAla: 5.305 ± 1.963
1.989ThrCys: 1.989 ± 0.707
1.989ThrAsp: 1.989 ± 0.629
3.316ThrGlu: 3.316 ± 1.043
2.653ThrPhe: 2.653 ± 0.457
3.979ThrGly: 3.979 ± 1.414
1.989ThrHis: 1.989 ± 0.68
4.642ThrIle: 4.642 ± 2.235
2.653ThrLys: 2.653 ± 0.457
4.642ThrLeu: 4.642 ± 2.251
0.663ThrMet: 0.663 ± 0.402
3.316ThrAsn: 3.316 ± 2.573
4.642ThrPro: 4.642 ± 1.02
1.326ThrGln: 1.326 ± 1.098
4.642ThrArg: 4.642 ± 2.016
2.653ThrSer: 2.653 ± 0.804
3.979ThrThr: 3.979 ± 1.757
5.305ThrVal: 5.305 ± 1.998
0.0ThrTrp: 0.0 ± 0.0
2.653ThrTyr: 2.653 ± 2.196
0.0ThrXaa: 0.0 ± 0.0
Val
5.968ValAla: 5.968 ± 1.858
2.653ValCys: 2.653 ± 1.203
5.968ValAsp: 5.968 ± 1.27
7.294ValGlu: 7.294 ± 2.525
5.968ValPhe: 5.968 ± 1.466
5.305ValGly: 5.305 ± 1.362
0.663ValHis: 0.663 ± 0.402
2.653ValIle: 2.653 ± 0.804
4.642ValLys: 4.642 ± 1.631
4.642ValLeu: 4.642 ± 2.246
1.989ValMet: 1.989 ± 0.939
0.663ValAsn: 0.663 ± 0.654
2.653ValPro: 2.653 ± 0.457
3.316ValGln: 3.316 ± 1.762
2.653ValArg: 2.653 ± 0.804
7.958ValSer: 7.958 ± 1.885
5.968ValThr: 5.968 ± 2.372
6.631ValVal: 6.631 ± 1.816
1.326ValTrp: 1.326 ± 0.601
2.653ValTyr: 2.653 ± 1.774
0.0ValXaa: 0.0 ± 0.0
Trp
1.326TrpAla: 1.326 ± 0.601
0.0TrpCys: 0.0 ± 0.0
1.326TrpAsp: 1.326 ± 0.55
0.663TrpGlu: 0.663 ± 0.402
1.989TrpPhe: 1.989 ± 0.68
1.989TrpGly: 1.989 ± 0.629
0.0TrpHis: 0.0 ± 0.0
0.663TrpIle: 0.663 ± 1.053
0.0TrpLys: 0.0 ± 0.0
1.326TrpLeu: 1.326 ± 0.55
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.326TrpGln: 1.326 ± 0.803
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.989TrpVal: 1.989 ± 0.629
0.0TrpTrp: 0.0 ± 0.0
0.663TrpTyr: 0.663 ± 0.402
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.663TyrAla: 0.663 ± 0.654
0.0TyrCys: 0.0 ± 0.0
1.989TyrAsp: 1.989 ± 0.68
0.0TyrGlu: 0.0 ± 0.0
0.663TyrPhe: 0.663 ± 0.654
1.326TyrGly: 1.326 ± 0.55
1.326TyrHis: 1.326 ± 0.803
1.326TyrIle: 1.326 ± 0.803
1.989TyrLys: 1.989 ± 0.874
1.989TyrLeu: 1.989 ± 0.707
0.663TyrMet: 0.663 ± 1.053
2.653TyrAsn: 2.653 ± 1.954
0.0TyrPro: 0.0 ± 0.0
4.642TyrGln: 4.642 ± 1.647
3.979TyrArg: 3.979 ± 0.897
0.663TyrSer: 0.663 ± 0.402
0.663TyrThr: 0.663 ± 0.654
2.653TyrVal: 2.653 ± 1.774
0.0TyrTrp: 0.0 ± 0.0
0.663TyrTyr: 0.663 ± 0.402
0.663TyrXaa: 0.663 ± 0.402
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.663XaaGly: 0.663 ± 0.402
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.663XaaAsn: 0.663 ± 0.402
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.663XaaTyr: 0.663 ± 0.402
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski