Amino acid dipepetide frequency for Alstroemeria necrotic streak virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.189AlaAla: 1.189 ± 1.229
1.189AlaCys: 1.189 ± 0.389
3.171AlaAsp: 3.171 ± 0.554
1.189AlaGlu: 1.189 ± 0.8
1.982AlaPhe: 1.982 ± 0.696
0.793AlaGly: 0.793 ± 0.303
0.793AlaHis: 0.793 ± 0.601
4.361AlaIle: 4.361 ± 1.491
4.559AlaLys: 4.559 ± 1.624
5.352AlaLeu: 5.352 ± 0.292
0.595AlaMet: 0.595 ± 0.892
3.171AlaAsn: 3.171 ± 0.474
0.793AlaPro: 0.793 ± 0.401
1.388AlaGln: 1.388 ± 0.429
0.991AlaArg: 0.991 ± 0.938
4.559AlaSer: 4.559 ± 0.927
2.577AlaThr: 2.577 ± 0.383
1.388AlaVal: 1.388 ± 0.553
0.396AlaTrp: 0.396 ± 0.437
1.189AlaTyr: 1.189 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.396CysAla: 0.396 ± 0.186
0.396CysCys: 0.396 ± 0.412
0.991CysAsp: 0.991 ± 0.411
1.189CysGlu: 1.189 ± 0.557
1.586CysPhe: 1.586 ± 0.365
0.991CysGly: 0.991 ± 1.093
0.396CysHis: 0.396 ± 0.262
2.18CysIle: 2.18 ± 1.046
1.189CysLys: 1.189 ± 0.809
2.973CysLeu: 2.973 ± 0.416
0.396CysMet: 0.396 ± 0.412
1.586CysAsn: 1.586 ± 0.765
1.388CysPro: 1.388 ± 0.725
0.595CysGln: 0.595 ± 0.693
0.991CysArg: 0.991 ± 0.567
2.18CysSer: 2.18 ± 1.31
0.991CysThr: 0.991 ± 0.684
1.388CysVal: 1.388 ± 0.304
0.0CysTrp: 0.0 ± 0.0
0.991CysTyr: 0.991 ± 0.404
0.0CysXaa: 0.0 ± 0.0
Asp
2.577AspAla: 2.577 ± 1.374
0.991AspCys: 0.991 ± 0.818
3.766AspAsp: 3.766 ± 0.762
3.568AspGlu: 3.568 ± 1.245
3.766AspPhe: 3.766 ± 0.582
1.586AspGly: 1.586 ± 1.203
1.388AspHis: 1.388 ± 0.571
5.748AspIle: 5.748 ± 2.442
3.37AspLys: 3.37 ± 1.062
6.739AspLeu: 6.739 ± 0.64
1.982AspMet: 1.982 ± 0.686
2.775AspAsn: 2.775 ± 0.605
1.784AspPro: 1.784 ± 1.22
1.189AspGln: 1.189 ± 0.443
1.982AspArg: 1.982 ± 0.579
7.532AspSer: 7.532 ± 0.219
3.766AspThr: 3.766 ± 0.922
2.775AspVal: 2.775 ± 0.644
0.198AspTrp: 0.198 ± 0.118
1.586AspTyr: 1.586 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
2.18GluAla: 2.18 ± 0.389
1.784GluCys: 1.784 ± 0.453
5.55GluAsp: 5.55 ± 1.878
6.145GluGlu: 6.145 ± 0.967
2.577GluPhe: 2.577 ± 0.32
3.171GluGly: 3.171 ± 0.73
1.189GluHis: 1.189 ± 1.057
5.946GluIle: 5.946 ± 1.819
6.541GluLys: 6.541 ± 1.588
5.748GluLeu: 5.748 ± 1.644
2.18GluMet: 2.18 ± 0.83
4.757GluAsn: 4.757 ± 0.757
0.991GluPro: 0.991 ± 0.472
2.18GluGln: 2.18 ± 0.377
2.18GluArg: 2.18 ± 0.583
4.559GluSer: 4.559 ± 1.739
4.361GluThr: 4.361 ± 1.13
2.775GluVal: 2.775 ± 0.73
0.793GluTrp: 0.793 ± 0.208
2.577GluTyr: 2.577 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
1.388PheAla: 1.388 ± 0.455
1.586PheCys: 1.586 ± 0.588
2.577PheAsp: 2.577 ± 0.845
3.37PheGlu: 3.37 ± 0.967
1.189PhePhe: 1.189 ± 0.775
2.973PheGly: 2.973 ± 0.668
0.793PheHis: 0.793 ± 0.303
2.775PheIle: 2.775 ± 0.7
4.757PheLys: 4.757 ± 1.064
5.352PheLeu: 5.352 ± 1.41
0.991PheMet: 0.991 ± 0.404
2.18PheAsn: 2.18 ± 0.615
1.189PhePro: 1.189 ± 0.775
1.388PheGln: 1.388 ± 0.408
1.388PheArg: 1.388 ± 0.553
4.757PheSer: 4.757 ± 0.859
1.388PheThr: 1.388 ± 0.55
1.982PheVal: 1.982 ± 0.984
0.198PheTrp: 0.198 ± 0.219
1.388PheTyr: 1.388 ± 0.881
0.198PheXaa: 0.198 ± 0.118
Gly
1.586GlyAla: 1.586 ± 0.4
1.388GlyCys: 1.388 ± 0.988
2.18GlyAsp: 2.18 ± 0.807
2.973GlyGlu: 2.973 ± 0.834
2.775GlyPhe: 2.775 ± 0.779
1.784GlyGly: 1.784 ± 0.499
0.396GlyHis: 0.396 ± 0.344
2.775GlyIle: 2.775 ± 0.73
3.766GlyLys: 3.766 ± 1.583
4.757GlyLeu: 4.757 ± 1.409
1.189GlyMet: 1.189 ± 0.711
2.577GlyAsn: 2.577 ± 1.377
0.793GlyPro: 0.793 ± 0.371
0.793GlyGln: 0.793 ± 0.339
1.982GlyArg: 1.982 ± 0.861
4.163GlySer: 4.163 ± 1.317
2.973GlyThr: 2.973 ± 0.93
2.18GlyVal: 2.18 ± 1.475
0.198GlyTrp: 0.198 ± 0.118
1.388GlyTyr: 1.388 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.396HisAla: 0.396 ± 0.262
0.198HisCys: 0.198 ± 0.219
1.586HisAsp: 1.586 ± 0.48
1.586HisGlu: 1.586 ± 0.565
0.793HisPhe: 0.793 ± 0.303
0.991HisGly: 0.991 ± 0.391
0.198HisHis: 0.198 ± 0.459
1.189HisIle: 1.189 ± 0.343
0.793HisLys: 0.793 ± 0.371
2.18HisLeu: 2.18 ± 0.615
0.396HisMet: 0.396 ± 0.237
1.982HisAsn: 1.982 ± 0.438
1.189HisPro: 1.189 ± 0.424
0.396HisGln: 0.396 ± 0.609
0.0HisArg: 0.0 ± 0.0
0.991HisSer: 0.991 ± 0.472
0.793HisThr: 0.793 ± 0.208
0.793HisVal: 0.793 ± 0.474
0.198HisTrp: 0.198 ± 0.118
0.793HisTyr: 0.793 ± 0.409
0.0HisXaa: 0.0 ± 0.0
Ile
4.955IleAla: 4.955 ± 0.825
1.388IleCys: 1.388 ± 1.15
2.973IleAsp: 2.973 ± 0.663
5.748IleGlu: 5.748 ± 1.862
3.171IlePhe: 3.171 ± 1.157
2.577IleGly: 2.577 ± 0.679
0.595IleHis: 0.595 ± 0.269
5.55IleIle: 5.55 ± 0.591
8.92IleLys: 8.92 ± 1.336
6.145IleLeu: 6.145 ± 0.903
2.18IleMet: 2.18 ± 0.863
4.757IleAsn: 4.757 ± 0.813
4.559IlePro: 4.559 ± 2.101
2.775IleGln: 2.775 ± 0.905
4.757IleArg: 4.757 ± 0.757
5.55IleSer: 5.55 ± 1.415
5.55IleThr: 5.55 ± 1.712
4.361IleVal: 4.361 ± 0.611
0.595IleTrp: 0.595 ± 0.395
3.964IleTyr: 3.964 ± 0.85
0.0IleXaa: 0.0 ± 0.0
Lys
3.766LysAla: 3.766 ± 0.858
1.388LysCys: 1.388 ± 0.785
3.37LysAsp: 3.37 ± 0.366
5.748LysGlu: 5.748 ± 1.668
3.171LysPhe: 3.171 ± 0.869
5.352LysGly: 5.352 ± 1.564
1.388LysHis: 1.388 ± 0.429
6.739LysIle: 6.739 ± 1.184
9.911LysLys: 9.911 ± 1.091
7.532LysLeu: 7.532 ± 0.922
2.379LysMet: 2.379 ± 0.616
4.163LysAsn: 4.163 ± 1.134
2.18LysPro: 2.18 ± 0.807
3.171LysGln: 3.171 ± 2.216
2.18LysArg: 2.18 ± 0.594
9.911LysSer: 9.911 ± 1.683
5.55LysThr: 5.55 ± 0.483
4.559LysVal: 4.559 ± 1.241
0.793LysTrp: 0.793 ± 0.303
3.37LysTyr: 3.37 ± 0.654
0.0LysXaa: 0.0 ± 0.0
Leu
5.154LeuAla: 5.154 ± 0.98
1.784LeuCys: 1.784 ± 0.453
5.55LeuAsp: 5.55 ± 1.132
6.938LeuGlu: 6.938 ± 1.617
4.163LeuPhe: 4.163 ± 0.809
4.955LeuGly: 4.955 ± 1.031
1.586LeuHis: 1.586 ± 0.366
7.532LeuIle: 7.532 ± 2.229
6.938LeuLys: 6.938 ± 1.281
8.127LeuLeu: 8.127 ± 1.16
3.37LeuMet: 3.37 ± 0.765
7.334LeuAsn: 7.334 ± 1.242
3.37LeuPro: 3.37 ± 1.133
2.379LeuGln: 2.379 ± 0.759
2.973LeuArg: 2.973 ± 1.072
10.307LeuSer: 10.307 ± 1.062
6.541LeuThr: 6.541 ± 0.922
4.757LeuVal: 4.757 ± 0.929
0.595LeuTrp: 0.595 ± 0.49
3.568LeuTyr: 3.568 ± 0.852
0.198LeuXaa: 0.198 ± 0.118
Met
0.198MetAla: 0.198 ± 0.219
0.595MetCys: 0.595 ± 0.623
0.793MetAsp: 0.793 ± 0.795
2.775MetGlu: 2.775 ± 0.927
0.991MetPhe: 0.991 ± 0.483
0.793MetGly: 0.793 ± 0.303
0.396MetHis: 0.396 ± 0.237
2.577MetIle: 2.577 ± 1.053
1.784MetLys: 1.784 ± 0.614
3.766MetLeu: 3.766 ± 0.747
1.388MetMet: 1.388 ± 0.529
1.982MetAsn: 1.982 ± 0.969
1.189MetPro: 1.189 ± 0.389
0.595MetGln: 0.595 ± 0.355
0.991MetArg: 0.991 ± 0.592
2.775MetSer: 2.775 ± 1.281
2.18MetThr: 2.18 ± 0.583
1.189MetVal: 1.189 ± 0.564
0.198MetTrp: 0.198 ± 0.118
1.586MetTyr: 1.586 ± 0.483
0.0MetXaa: 0.0 ± 0.0
Asn
3.568AsnAla: 3.568 ± 1.025
1.982AsnCys: 1.982 ± 0.466
4.361AsnAsp: 4.361 ± 0.588
3.568AsnGlu: 3.568 ± 1.535
2.379AsnPhe: 2.379 ± 0.778
3.171AsnGly: 3.171 ± 0.576
0.991AsnHis: 0.991 ± 0.492
4.955AsnIle: 4.955 ± 0.745
5.55AsnLys: 5.55 ± 0.889
6.343AsnLeu: 6.343 ± 1.072
1.586AsnMet: 1.586 ± 0.365
3.964AsnAsn: 3.964 ± 1.071
2.18AsnPro: 2.18 ± 0.799
2.379AsnGln: 2.379 ± 0.606
1.189AsnArg: 1.189 ± 0.979
3.171AsnSer: 3.171 ± 0.59
2.775AsnThr: 2.775 ± 0.717
4.757AsnVal: 4.757 ± 0.856
0.793AsnTrp: 0.793 ± 0.823
2.577AsnTyr: 2.577 ± 0.845
0.0AsnXaa: 0.0 ± 0.0
Pro
0.793ProAla: 0.793 ± 0.323
0.396ProCys: 0.396 ± 0.237
1.982ProAsp: 1.982 ± 0.603
3.171ProGlu: 3.171 ± 1.016
1.586ProPhe: 1.586 ± 0.447
1.189ProGly: 1.189 ± 0.443
0.0ProHis: 0.0 ± 0.0
3.171ProIle: 3.171 ± 1.059
2.577ProLys: 2.577 ± 0.704
3.568ProLeu: 3.568 ± 1.341
0.396ProMet: 0.396 ± 0.437
2.379ProAsn: 2.379 ± 1.329
0.396ProPro: 0.396 ± 0.473
1.189ProGln: 1.189 ± 0.767
0.595ProArg: 0.595 ± 0.394
1.784ProSer: 1.784 ± 0.676
1.982ProThr: 1.982 ± 1.182
2.18ProVal: 2.18 ± 0.623
0.0ProTrp: 0.0 ± 0.0
0.991ProTyr: 0.991 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
2.577GlnAla: 2.577 ± 1.509
0.595GlnCys: 0.595 ± 0.388
2.18GlnAsp: 2.18 ± 0.791
1.189GlnGlu: 1.189 ± 1.006
1.586GlnPhe: 1.586 ± 0.646
1.189GlnGly: 1.189 ± 0.689
0.396GlnHis: 0.396 ± 0.262
2.577GlnIle: 2.577 ± 0.455
2.18GlnLys: 2.18 ± 0.628
2.379GlnLeu: 2.379 ± 1.052
0.595GlnMet: 0.595 ± 0.222
2.379GlnAsn: 2.379 ± 1.03
0.0GlnPro: 0.0 ± 0.0
0.595GlnGln: 0.595 ± 0.388
1.189GlnArg: 1.189 ± 0.502
2.973GlnSer: 2.973 ± 1.237
2.775GlnThr: 2.775 ± 0.832
1.189GlnVal: 1.189 ± 0.703
0.0GlnTrp: 0.0 ± 0.0
0.396GlnTyr: 0.396 ± 0.473
0.0GlnXaa: 0.0 ± 0.0
Arg
0.991ArgAla: 0.991 ± 1.273
0.396ArgCys: 0.396 ± 0.437
2.577ArgAsp: 2.577 ± 1.006
3.37ArgGlu: 3.37 ± 1.139
0.595ArgPhe: 0.595 ± 0.259
0.991ArgGly: 0.991 ± 0.472
1.784ArgHis: 1.784 ± 0.853
2.973ArgIle: 2.973 ± 0.84
1.586ArgLys: 1.586 ± 0.483
4.163ArgLeu: 4.163 ± 0.812
0.595ArgMet: 0.595 ± 0.395
2.577ArgAsn: 2.577 ± 0.845
0.396ArgPro: 0.396 ± 0.186
1.388ArgGln: 1.388 ± 0.508
0.793ArgArg: 0.793 ± 0.539
2.973ArgSer: 2.973 ± 0.882
1.982ArgThr: 1.982 ± 0.941
2.18ArgVal: 2.18 ± 0.384
0.595ArgTrp: 0.595 ± 0.222
1.388ArgTyr: 1.388 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
2.18SerAla: 2.18 ± 0.427
2.18SerCys: 2.18 ± 0.888
5.946SerAsp: 5.946 ± 1.124
5.946SerGlu: 5.946 ± 1.295
5.154SerPhe: 5.154 ± 1.062
3.766SerGly: 3.766 ± 1.4
1.586SerHis: 1.586 ± 0.512
7.136SerIle: 7.136 ± 1.961
7.532SerLys: 7.532 ± 1.017
9.911SerLeu: 9.911 ± 1.353
2.973SerMet: 2.973 ± 0.708
2.973SerAsn: 2.973 ± 0.648
2.973SerPro: 2.973 ± 0.902
2.379SerGln: 2.379 ± 1.052
4.163SerArg: 4.163 ± 0.811
6.938SerSer: 6.938 ± 1.245
5.748SerThr: 5.748 ± 1.526
6.541SerVal: 6.541 ± 2.035
1.189SerTrp: 1.189 ± 0.97
4.361SerTyr: 4.361 ± 1.197
0.198SerXaa: 0.198 ± 0.118
Thr
3.37ThrAla: 3.37 ± 0.74
1.784ThrCys: 1.784 ± 0.753
3.171ThrAsp: 3.171 ± 0.514
3.37ThrGlu: 3.37 ± 0.879
2.577ThrPhe: 2.577 ± 1.213
2.775ThrGly: 2.775 ± 0.385
0.991ThrHis: 0.991 ± 0.592
4.955ThrIle: 4.955 ± 0.754
5.352ThrLys: 5.352 ± 1.387
4.361ThrLeu: 4.361 ± 1.324
2.577ThrMet: 2.577 ± 0.66
3.171ThrAsn: 3.171 ± 0.845
1.189ThrPro: 1.189 ± 0.512
2.18ThrGln: 2.18 ± 0.902
2.379ThrArg: 2.379 ± 0.498
6.343ThrSer: 6.343 ± 0.885
2.775ThrThr: 2.775 ± 0.821
3.964ThrVal: 3.964 ± 0.56
0.595ThrTrp: 0.595 ± 0.395
2.577ThrTyr: 2.577 ± 0.999
0.0ThrXaa: 0.0 ± 0.0
Val
2.973ValAla: 2.973 ± 1.451
1.586ValCys: 1.586 ± 0.597
3.171ValAsp: 3.171 ± 0.634
3.37ValGlu: 3.37 ± 0.451
1.388ValPhe: 1.388 ± 0.402
2.379ValGly: 2.379 ± 0.338
1.388ValHis: 1.388 ± 0.719
4.757ValIle: 4.757 ± 2.167
4.757ValLys: 4.757 ± 1.217
3.964ValLeu: 3.964 ± 0.732
1.189ValMet: 1.189 ± 0.421
4.361ValAsn: 4.361 ± 0.874
1.784ValPro: 1.784 ± 0.753
0.991ValGln: 0.991 ± 0.391
1.784ValArg: 1.784 ± 0.614
4.757ValSer: 4.757 ± 0.975
2.973ValThr: 2.973 ± 0.621
2.577ValVal: 2.577 ± 0.673
0.595ValTrp: 0.595 ± 0.269
2.775ValTyr: 2.775 ± 0.81
0.198ValXaa: 0.198 ± 0.118
Trp
0.0TrpAla: 0.0 ± 0.0
0.396TrpCys: 0.396 ± 0.186
0.991TrpAsp: 0.991 ± 0.411
0.198TrpGlu: 0.198 ± 0.118
0.396TrpPhe: 0.396 ± 0.186
0.396TrpGly: 0.396 ± 0.344
0.0TrpHis: 0.0 ± 0.0
0.793TrpIle: 0.793 ± 0.208
1.189TrpLys: 1.189 ± 0.389
1.189TrpLeu: 1.189 ± 0.435
0.198TrpMet: 0.198 ± 0.118
0.198TrpAsn: 0.198 ± 0.118
0.0TrpPro: 0.0 ± 0.0
0.198TrpGln: 0.198 ± 0.118
0.0TrpArg: 0.0 ± 0.0
0.991TrpSer: 0.991 ± 0.776
0.595TrpThr: 0.595 ± 0.394
0.198TrpVal: 0.198 ± 0.459
0.0TrpTrp: 0.0 ± 0.0
0.396TrpTyr: 0.396 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.388TyrAla: 1.388 ± 0.457
0.793TyrCys: 0.793 ± 0.371
2.18TyrAsp: 2.18 ± 0.557
2.775TyrGlu: 2.775 ± 0.178
1.982TyrPhe: 1.982 ± 0.617
0.793TyrGly: 0.793 ± 0.47
1.189TyrHis: 1.189 ± 0.711
2.577TyrIle: 2.577 ± 0.546
3.171TyrLys: 3.171 ± 0.554
3.37TyrLeu: 3.37 ± 0.456
1.189TyrMet: 1.189 ± 0.343
2.973TyrAsn: 2.973 ± 0.732
1.982TyrPro: 1.982 ± 0.617
0.793TyrGln: 0.793 ± 0.539
1.784TyrArg: 1.784 ± 0.665
4.559TyrSer: 4.559 ± 0.266
1.982TyrThr: 1.982 ± 0.476
1.982TyrVal: 1.982 ± 0.807
0.396TyrTrp: 0.396 ± 0.344
1.189TyrTyr: 1.189 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.198XaaLys: 0.198 ± 0.118
0.198XaaLeu: 0.198 ± 0.118
0.198XaaMet: 0.198 ± 0.118
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.198XaaThr: 0.198 ± 0.118
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5046 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski