Amino acid dipepetide frequency for Vesicular stomatitis Indiana virus (strain San Juan) (VSIV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.207AlaAla: 5.207 ± 2.288
0.744AlaCys: 0.744 ± 0.304
4.215AlaAsp: 4.215 ± 1.552
1.984AlaGlu: 1.984 ± 1.024
1.984AlaPhe: 1.984 ± 0.823
1.984AlaGly: 1.984 ± 0.807
0.744AlaHis: 0.744 ± 0.447
1.984AlaIle: 1.984 ± 0.558
1.984AlaLys: 1.984 ± 0.722
4.463AlaLeu: 4.463 ± 1.173
0.992AlaMet: 0.992 ± 0.596
1.984AlaAsn: 1.984 ± 0.946
3.967AlaPro: 3.967 ± 2.183
1.984AlaGln: 1.984 ± 0.651
2.727AlaArg: 2.727 ± 0.689
3.719AlaSer: 3.719 ± 0.942
2.975AlaThr: 2.975 ± 0.513
5.207AlaVal: 5.207 ± 1.465
0.744AlaTrp: 0.744 ± 0.812
2.232AlaTyr: 2.232 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.619
0.248CysCys: 0.248 ± 0.149
0.744CysAsp: 0.744 ± 0.532
0.992CysGlu: 0.992 ± 0.726
0.248CysPhe: 0.248 ± 0.149
0.496CysGly: 0.496 ± 0.258
0.248CysHis: 0.248 ± 0.293
0.744CysIle: 0.744 ± 0.304
2.48CysLys: 2.48 ± 0.982
0.992CysLeu: 0.992 ± 0.403
0.496CysMet: 0.496 ± 0.645
0.496CysAsn: 0.496 ± 0.298
1.24CysPro: 1.24 ± 0.783
0.992CysGln: 0.992 ± 0.403
0.992CysArg: 0.992 ± 0.596
0.992CysSer: 0.992 ± 0.652
0.744CysThr: 0.744 ± 0.376
0.744CysVal: 0.744 ± 0.344
0.496CysTrp: 0.496 ± 0.298
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.488AspAla: 1.488 ± 0.475
0.248AspCys: 0.248 ± 0.344
4.463AspAsp: 4.463 ± 0.649
4.959AspGlu: 4.959 ± 1.492
2.727AspPhe: 2.727 ± 0.463
3.719AspGly: 3.719 ± 0.882
2.232AspHis: 2.232 ± 1.382
3.223AspIle: 3.223 ± 0.937
3.719AspLys: 3.719 ± 1.051
6.199AspLeu: 6.199 ± 1.858
1.488AspMet: 1.488 ± 0.685
2.48AspAsn: 2.48 ± 0.918
3.223AspPro: 3.223 ± 0.629
1.984AspGln: 1.984 ± 0.688
0.992AspArg: 0.992 ± 0.596
4.463AspSer: 4.463 ± 0.522
2.727AspThr: 2.727 ± 0.816
4.463AspVal: 4.463 ± 1.251
0.992AspTrp: 0.992 ± 0.489
2.727AspTyr: 2.727 ± 0.587
0.0AspXaa: 0.0 ± 0.0
Glu
3.471GluAla: 3.471 ± 1.922
0.744GluCys: 0.744 ± 0.603
3.471GluAsp: 3.471 ± 0.873
1.984GluGlu: 1.984 ± 0.74
2.232GluPhe: 2.232 ± 0.822
3.967GluGly: 3.967 ± 1.098
1.736GluHis: 1.736 ± 1.184
3.471GluIle: 3.471 ± 1.063
4.711GluLys: 4.711 ± 1.826
4.959GluLeu: 4.959 ± 1.096
2.232GluMet: 2.232 ± 0.474
1.24GluAsn: 1.24 ± 0.745
1.488GluPro: 1.488 ± 0.51
2.232GluGln: 2.232 ± 0.667
2.232GluArg: 2.232 ± 0.643
3.719GluSer: 3.719 ± 0.727
3.223GluThr: 3.223 ± 1.327
2.727GluVal: 2.727 ± 1.084
1.488GluTrp: 1.488 ± 0.682
3.471GluTyr: 3.471 ± 1.25
0.0GluXaa: 0.0 ± 0.0
Phe
1.488PheAla: 1.488 ± 0.608
0.248PheCys: 0.248 ± 0.344
1.24PheAsp: 1.24 ± 0.941
2.48PheGlu: 2.48 ± 0.719
3.471PhePhe: 3.471 ± 1.76
3.471PheGly: 3.471 ± 1.067
0.496PheHis: 0.496 ± 0.688
2.727PheIle: 2.727 ± 0.814
3.967PheLys: 3.967 ± 1.023
5.703PheLeu: 5.703 ± 2.019
0.0PheMet: 0.0 ± 0.0
3.471PheAsn: 3.471 ± 1.261
2.48PhePro: 2.48 ± 1.089
1.736PheGln: 1.736 ± 1.092
4.711PheArg: 4.711 ± 2.42
3.719PheSer: 3.719 ± 0.709
2.232PheThr: 2.232 ± 1.125
1.24PheVal: 1.24 ± 0.745
0.496PheTrp: 0.496 ± 0.31
0.992PheTyr: 0.992 ± 0.726
0.0PheXaa: 0.0 ± 0.0
Gly
1.984GlyAla: 1.984 ± 0.538
0.0GlyCys: 0.0 ± 0.0
3.223GlyAsp: 3.223 ± 0.896
2.48GlyGlu: 2.48 ± 1.417
1.984GlyPhe: 1.984 ± 0.558
2.48GlyGly: 2.48 ± 0.652
0.744GlyHis: 0.744 ± 0.336
3.471GlyIle: 3.471 ± 1.077
5.703GlyLys: 5.703 ± 1.866
10.414GlyLeu: 10.414 ± 1.504
2.727GlyMet: 2.727 ± 0.549
1.736GlyAsn: 1.736 ± 0.582
2.975GlyPro: 2.975 ± 0.909
2.48GlyGln: 2.48 ± 0.611
3.719GlyArg: 3.719 ± 0.776
3.223GlySer: 3.223 ± 0.597
4.215GlyThr: 4.215 ± 1.421
4.463GlyVal: 4.463 ± 0.717
1.984GlyTrp: 1.984 ± 0.573
1.488GlyTyr: 1.488 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.736HisAla: 1.736 ± 0.799
0.992HisCys: 0.992 ± 0.726
1.488HisAsp: 1.488 ± 0.894
1.488HisGlu: 1.488 ± 0.671
3.223HisPhe: 3.223 ± 2.141
1.488HisGly: 1.488 ± 0.43
0.248HisHis: 0.248 ± 0.293
1.488HisIle: 1.488 ± 0.66
1.736HisLys: 1.736 ± 0.531
1.984HisLeu: 1.984 ± 0.573
0.992HisMet: 0.992 ± 0.854
1.24HisAsn: 1.24 ± 0.893
1.24HisPro: 1.24 ± 0.362
0.248HisGln: 0.248 ± 0.149
2.48HisArg: 2.48 ± 0.779
1.488HisSer: 1.488 ± 0.66
1.24HisThr: 1.24 ± 1.039
1.736HisVal: 1.736 ± 0.551
1.24HisTrp: 1.24 ± 0.799
1.24HisTyr: 1.24 ± 0.526
0.0HisXaa: 0.0 ± 0.0
Ile
2.232IleAla: 2.232 ± 0.763
1.488IleCys: 1.488 ± 0.66
4.463IleAsp: 4.463 ± 1.205
4.215IleGlu: 4.215 ± 0.592
1.984IlePhe: 1.984 ± 0.826
5.951IleGly: 5.951 ± 1.549
1.984IleHis: 1.984 ± 0.68
2.232IleIle: 2.232 ± 0.765
3.471IleLys: 3.471 ± 0.969
6.199IleLeu: 6.199 ± 0.998
1.24IleMet: 1.24 ± 0.56
2.975IleAsn: 2.975 ± 0.906
3.719IlePro: 3.719 ± 1.961
2.975IleGln: 2.975 ± 1.065
5.207IleArg: 5.207 ± 2.092
4.959IleSer: 4.959 ± 0.942
2.975IleThr: 2.975 ± 0.7
3.223IleVal: 3.223 ± 0.77
0.744IleTrp: 0.744 ± 0.812
2.975IleTyr: 2.975 ± 0.819
0.0IleXaa: 0.0 ± 0.0
Lys
5.455LysAla: 5.455 ± 2.962
0.992LysCys: 0.992 ± 0.517
3.223LysAsp: 3.223 ± 1.138
3.471LysGlu: 3.471 ± 0.802
3.471LysPhe: 3.471 ± 1.305
3.967LysGly: 3.967 ± 1.2
1.24LysHis: 1.24 ± 0.679
4.463LysIle: 4.463 ± 0.864
5.703LysLys: 5.703 ± 1.177
6.199LysLeu: 6.199 ± 1.706
3.719LysMet: 3.719 ± 1.375
2.232LysAsn: 2.232 ± 0.723
1.24LysPro: 1.24 ± 0.801
0.744LysGln: 0.744 ± 0.304
3.223LysArg: 3.223 ± 0.861
6.943LysSer: 6.943 ± 1.675
4.215LysThr: 4.215 ± 0.908
2.48LysVal: 2.48 ± 0.857
1.984LysTrp: 1.984 ± 0.68
2.48LysTyr: 2.48 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
6.199LeuAla: 6.199 ± 0.793
1.984LeuCys: 1.984 ± 0.701
5.207LeuAsp: 5.207 ± 0.696
3.719LeuGlu: 3.719 ± 1.126
3.719LeuPhe: 3.719 ± 1.054
5.455LeuGly: 5.455 ± 0.799
2.48LeuHis: 2.48 ± 1.231
7.935LeuIle: 7.935 ± 1.895
8.182LeuLys: 8.182 ± 1.278
6.199LeuLeu: 6.199 ± 2.202
4.215LeuMet: 4.215 ± 0.821
4.711LeuAsn: 4.711 ± 0.682
3.967LeuPro: 3.967 ± 1.031
1.984LeuGln: 1.984 ± 0.702
5.455LeuArg: 5.455 ± 0.479
7.439LeuSer: 7.439 ± 1.404
5.951LeuThr: 5.951 ± 1.451
2.727LeuVal: 2.727 ± 0.778
0.744LeuTrp: 0.744 ± 0.304
5.455LeuTyr: 5.455 ± 1.569
0.0LeuXaa: 0.0 ± 0.0
Met
1.736MetAla: 1.736 ± 0.652
0.248MetCys: 0.248 ± 0.293
2.232MetAsp: 2.232 ± 0.673
1.736MetGlu: 1.736 ± 0.63
1.736MetPhe: 1.736 ± 0.798
1.984MetGly: 1.984 ± 0.666
0.992MetHis: 0.992 ± 0.524
2.48MetIle: 2.48 ± 1.195
0.992MetLys: 0.992 ± 0.35
3.223MetLeu: 3.223 ± 0.597
0.744MetMet: 0.744 ± 0.637
0.992MetAsn: 0.992 ± 0.35
0.992MetPro: 0.992 ± 0.629
0.992MetGln: 0.992 ± 0.307
1.24MetArg: 1.24 ± 0.528
3.967MetSer: 3.967 ± 1.072
2.727MetThr: 2.727 ± 1.385
1.736MetVal: 1.736 ± 0.551
0.496MetTrp: 0.496 ± 0.298
0.744MetTyr: 0.744 ± 0.812
0.0MetXaa: 0.0 ± 0.0
Asn
1.736AsnAla: 1.736 ± 0.763
0.248AsnCys: 0.248 ± 0.293
1.488AsnAsp: 1.488 ± 0.435
1.984AsnGlu: 1.984 ± 0.728
0.744AsnPhe: 0.744 ± 0.447
1.984AsnGly: 1.984 ± 0.843
1.736AsnHis: 1.736 ± 1.043
2.727AsnIle: 2.727 ± 0.653
0.744AsnLys: 0.744 ± 0.344
4.711AsnLeu: 4.711 ± 0.882
0.0AsnMet: 0.0 ± 0.0
0.992AsnAsn: 0.992 ± 0.596
3.471AsnPro: 3.471 ± 1.162
3.471AsnGln: 3.471 ± 1.122
2.232AsnArg: 2.232 ± 0.692
2.975AsnSer: 2.975 ± 0.513
2.232AsnThr: 2.232 ± 0.785
2.232AsnVal: 2.232 ± 0.5
1.488AsnTrp: 1.488 ± 0.608
2.48AsnTyr: 2.48 ± 0.6
0.0AsnXaa: 0.0 ± 0.0
Pro
3.223ProAla: 3.223 ± 0.688
0.496ProCys: 0.496 ± 0.298
3.967ProAsp: 3.967 ± 1.256
4.463ProGlu: 4.463 ± 2.166
3.967ProPhe: 3.967 ± 1.945
0.992ProGly: 0.992 ± 0.549
2.48ProHis: 2.48 ± 0.936
4.463ProIle: 4.463 ± 1.147
1.984ProLys: 1.984 ± 0.68
4.711ProLeu: 4.711 ± 1.14
1.984ProMet: 1.984 ± 1.452
1.984ProAsn: 1.984 ± 0.621
3.719ProPro: 3.719 ± 0.947
0.992ProGln: 0.992 ± 0.307
0.744ProArg: 0.744 ± 0.447
4.711ProSer: 4.711 ± 1.106
2.48ProThr: 2.48 ± 0.956
1.736ProVal: 1.736 ± 0.363
0.496ProTrp: 0.496 ± 0.298
1.24ProTyr: 1.24 ± 0.739
0.0ProXaa: 0.0 ± 0.0
Gln
1.488GlnAla: 1.488 ± 1.109
0.992GlnCys: 0.992 ± 0.307
1.488GlnAsp: 1.488 ± 0.493
1.488GlnGlu: 1.488 ± 0.351
1.736GlnPhe: 1.736 ± 0.582
4.215GlnGly: 4.215 ± 0.744
0.496GlnHis: 0.496 ± 0.298
2.232GlnIle: 2.232 ± 0.782
2.232GlnLys: 2.232 ± 1.256
1.984GlnLeu: 1.984 ± 0.578
1.488GlnMet: 1.488 ± 0.908
1.736GlnAsn: 1.736 ± 0.615
1.984GlnPro: 1.984 ± 1.084
1.488GlnGln: 1.488 ± 0.619
0.744GlnArg: 0.744 ± 0.376
2.48GlnSer: 2.48 ± 0.725
1.736GlnThr: 1.736 ± 0.698
2.727GlnVal: 2.727 ± 0.823
0.248GlnTrp: 0.248 ± 0.418
0.992GlnTyr: 0.992 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
2.727ArgAla: 2.727 ± 0.666
0.496ArgCys: 0.496 ± 0.298
2.48ArgAsp: 2.48 ± 1.087
3.967ArgGlu: 3.967 ± 0.399
1.984ArgPhe: 1.984 ± 0.68
4.215ArgGly: 4.215 ± 1.258
0.496ArgHis: 0.496 ± 0.298
3.471ArgIle: 3.471 ± 0.996
1.736ArgLys: 1.736 ± 0.941
2.975ArgLeu: 2.975 ± 0.859
2.727ArgMet: 2.727 ± 0.961
2.48ArgAsn: 2.48 ± 1.231
3.471ArgPro: 3.471 ± 2.043
2.48ArgGln: 2.48 ± 0.624
1.984ArgArg: 1.984 ± 0.666
4.711ArgSer: 4.711 ± 0.845
3.223ArgThr: 3.223 ± 0.48
2.232ArgVal: 2.232 ± 0.432
0.992ArgTrp: 0.992 ± 0.403
1.736ArgTyr: 1.736 ± 0.937
0.0ArgXaa: 0.0 ± 0.0
Ser
2.727SerAla: 2.727 ± 0.499
2.232SerCys: 2.232 ± 0.65
5.951SerAsp: 5.951 ± 1.601
3.719SerGlu: 3.719 ± 1.14
2.727SerPhe: 2.727 ± 0.772
4.711SerGly: 4.711 ± 0.512
3.967SerHis: 3.967 ± 1.173
4.959SerIle: 4.959 ± 0.892
5.455SerLys: 5.455 ± 2.11
9.67SerLeu: 9.67 ± 1.123
1.736SerMet: 1.736 ± 0.415
4.215SerAsn: 4.215 ± 1.57
2.48SerPro: 2.48 ± 0.963
3.223SerGln: 3.223 ± 0.891
3.967SerArg: 3.967 ± 1.099
9.174SerSer: 9.174 ± 1.421
4.215SerThr: 4.215 ± 1.301
2.975SerVal: 2.975 ± 0.606
2.232SerTrp: 2.232 ± 1.086
3.471SerTyr: 3.471 ± 0.659
0.0SerXaa: 0.0 ± 0.0
Thr
2.232ThrAla: 2.232 ± 0.723
1.24ThrCys: 1.24 ± 0.526
1.488ThrAsp: 1.488 ± 0.608
1.736ThrGlu: 1.736 ± 0.363
2.727ThrPhe: 2.727 ± 0.73
3.967ThrGly: 3.967 ± 1.22
2.48ThrHis: 2.48 ± 0.676
6.199ThrIle: 6.199 ± 1.287
2.727ThrLys: 2.727 ± 0.736
2.727ThrLeu: 2.727 ± 1.057
2.232ThrMet: 2.232 ± 0.73
1.736ThrAsn: 1.736 ± 0.763
3.967ThrPro: 3.967 ± 1.604
1.24ThrGln: 1.24 ± 0.774
1.984ThrArg: 1.984 ± 0.68
5.455ThrSer: 5.455 ± 1.134
3.967ThrThr: 3.967 ± 1.226
4.463ThrVal: 4.463 ± 1.0
1.736ThrTrp: 1.736 ± 0.709
2.48ThrTyr: 2.48 ± 1.326
0.0ThrXaa: 0.0 ± 0.0
Val
1.736ValAla: 1.736 ± 0.798
1.24ValCys: 1.24 ± 0.52
3.471ValAsp: 3.471 ± 1.125
4.463ValGlu: 4.463 ± 0.85
1.488ValPhe: 1.488 ± 0.493
2.48ValGly: 2.48 ± 1.116
1.736ValHis: 1.736 ± 0.799
3.719ValIle: 3.719 ± 1.342
3.719ValLys: 3.719 ± 0.314
4.711ValLeu: 4.711 ± 1.123
1.24ValMet: 1.24 ± 0.459
1.24ValAsn: 1.24 ± 0.346
3.223ValPro: 3.223 ± 0.48
1.488ValGln: 1.488 ± 0.351
3.223ValArg: 3.223 ± 0.67
5.207ValSer: 5.207 ± 0.889
2.727ValThr: 2.727 ± 0.666
1.24ValVal: 1.24 ± 0.801
0.744ValTrp: 0.744 ± 0.603
1.24ValTyr: 1.24 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
0.496TrpAla: 0.496 ± 0.258
0.0TrpCys: 0.0 ± 0.0
2.48TrpAsp: 2.48 ± 1.417
0.992TrpGlu: 0.992 ± 0.596
1.24TrpPhe: 1.24 ± 0.491
1.984TrpGly: 1.984 ± 0.639
0.744TrpHis: 0.744 ± 0.532
0.992TrpIle: 0.992 ± 0.43
2.232TrpLys: 2.232 ± 0.711
1.736TrpLeu: 1.736 ± 0.593
0.496TrpMet: 0.496 ± 0.258
0.496TrpAsn: 0.496 ± 0.368
0.248TrpPro: 0.248 ± 0.149
0.248TrpGln: 0.248 ± 0.149
0.744TrpArg: 0.744 ± 0.447
1.984TrpSer: 1.984 ± 0.43
0.992TrpThr: 0.992 ± 0.596
1.24TrpVal: 1.24 ± 1.065
0.0TrpTrp: 0.0 ± 0.0
0.248TrpTyr: 0.248 ± 0.293
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.471TyrAla: 3.471 ± 1.284
0.496TyrCys: 0.496 ± 0.585
1.736TyrAsp: 1.736 ± 1.533
2.48TyrGlu: 2.48 ± 1.054
2.975TyrPhe: 2.975 ± 0.326
1.984TyrGly: 1.984 ± 0.639
1.736TyrHis: 1.736 ± 0.652
2.232TyrIle: 2.232 ± 1.03
3.967TyrLys: 3.967 ± 1.428
4.215TyrLeu: 4.215 ± 0.46
0.992TyrMet: 0.992 ± 0.307
1.24TyrAsn: 1.24 ± 0.534
1.736TyrPro: 1.736 ± 0.807
0.992TyrGln: 0.992 ± 0.43
1.984TyrArg: 1.984 ± 0.826
2.48TyrSer: 2.48 ± 0.96
1.984TyrThr: 1.984 ± 0.891
0.744TyrVal: 0.744 ± 0.344
0.248TyrTrp: 0.248 ± 0.418
0.992TyrTyr: 0.992 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4034 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski