Amino acid dipepetide frequency for Salanga virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.206AlaAla: 5.206 ± 0.657
1.239AlaCys: 1.239 ± 0.768
1.983AlaAsp: 1.983 ± 0.947
4.214AlaGlu: 4.214 ± 0.73
1.735AlaPhe: 1.735 ± 1.847
2.231AlaGly: 2.231 ± 0.629
0.992AlaHis: 0.992 ± 0.767
3.223AlaIle: 3.223 ± 0.92
4.214AlaLys: 4.214 ± 2.996
5.702AlaLeu: 5.702 ± 1.584
2.231AlaMet: 2.231 ± 0.591
0.496AlaAsn: 0.496 ± 0.145
1.735AlaPro: 1.735 ± 0.358
2.479AlaGln: 2.479 ± 0.643
3.966AlaArg: 3.966 ± 0.909
3.471AlaSer: 3.471 ± 0.908
2.231AlaThr: 2.231 ± 1.103
6.197AlaVal: 6.197 ± 1.188
0.248AlaTrp: 0.248 ± 0.157
1.487AlaTyr: 1.487 ± 1.151
0.0AlaXaa: 0.0 ± 0.0
Cys
0.496CysAla: 0.496 ± 0.315
0.248CysCys: 0.248 ± 0.22
1.487CysAsp: 1.487 ± 0.779
1.735CysGlu: 1.735 ± 0.453
1.487CysPhe: 1.487 ± 0.435
1.239CysGly: 1.239 ± 0.768
0.496CysHis: 0.496 ± 0.44
1.487CysIle: 1.487 ± 0.823
2.975CysLys: 2.975 ± 0.869
1.983CysLeu: 1.983 ± 0.517
1.487CysMet: 1.487 ± 0.329
0.992CysAsn: 0.992 ± 0.29
0.992CysPro: 0.992 ± 0.551
0.496CysGln: 0.496 ± 0.44
0.744CysArg: 0.744 ± 0.208
2.727CysSer: 2.727 ± 1.143
1.487CysThr: 1.487 ± 0.675
2.727CysVal: 2.727 ± 1.7
0.248CysTrp: 0.248 ± 0.22
1.239CysTyr: 1.239 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
2.231AspAla: 2.231 ± 0.449
1.487AspCys: 1.487 ± 0.779
3.966AspAsp: 3.966 ± 0.909
4.214AspGlu: 4.214 ± 1.187
3.966AspPhe: 3.966 ± 1.296
3.471AspGly: 3.471 ± 0.494
1.239AspHis: 1.239 ± 0.486
5.454AspIle: 5.454 ± 1.532
3.471AspLys: 3.471 ± 0.667
6.197AspLeu: 6.197 ± 1.651
3.718AspMet: 3.718 ± 1.146
3.223AspAsn: 3.223 ± 0.536
2.975AspPro: 2.975 ± 1.017
1.239AspGln: 1.239 ± 0.323
2.975AspArg: 2.975 ± 0.736
4.71AspSer: 4.71 ± 1.688
3.718AspThr: 3.718 ± 0.965
2.231AspVal: 2.231 ± 0.678
0.496AspTrp: 0.496 ± 0.145
1.487AspTyr: 1.487 ± 0.329
0.0AspXaa: 0.0 ± 0.0
Glu
1.983GluAla: 1.983 ± 0.678
1.983GluCys: 1.983 ± 0.579
3.966GluAsp: 3.966 ± 1.833
4.958GluGlu: 4.958 ± 1.046
3.966GluPhe: 3.966 ± 0.536
3.471GluGly: 3.471 ± 0.605
1.239GluHis: 1.239 ± 1.166
3.966GluIle: 3.966 ± 1.539
3.966GluLys: 3.966 ± 1.159
4.462GluLeu: 4.462 ± 0.564
0.992GluMet: 0.992 ± 0.29
3.966GluAsn: 3.966 ± 0.862
2.975GluPro: 2.975 ± 0.949
2.479GluGln: 2.479 ± 0.787
3.718GluArg: 3.718 ± 0.767
3.471GluSer: 3.471 ± 0.494
3.718GluThr: 3.718 ± 1.458
5.454GluVal: 5.454 ± 1.486
0.496GluTrp: 0.496 ± 0.41
1.735GluTyr: 1.735 ± 1.102
0.0GluXaa: 0.0 ± 0.0
Phe
3.223PheAla: 3.223 ± 2.116
0.992PheCys: 0.992 ± 0.551
2.975PheAsp: 2.975 ± 0.247
2.479PheGlu: 2.479 ± 0.645
2.727PhePhe: 2.727 ± 0.626
1.983PheGly: 1.983 ± 0.405
0.496PheHis: 0.496 ± 0.145
2.231PheIle: 2.231 ± 1.013
3.718PheLys: 3.718 ± 0.597
4.958PheLeu: 4.958 ± 0.367
0.992PheMet: 0.992 ± 0.426
2.975PheAsn: 2.975 ± 0.714
2.727PhePro: 2.727 ± 1.958
0.0PheGln: 0.0 ± 0.0
2.727PheArg: 2.727 ± 0.877
1.983PheSer: 1.983 ± 0.405
2.727PheThr: 2.727 ± 0.726
2.727PheVal: 2.727 ± 0.877
1.239PheTrp: 1.239 ± 0.43
0.248PheTyr: 0.248 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
3.471GlyAla: 3.471 ± 0.617
2.231GlyCys: 2.231 ± 0.75
3.471GlyAsp: 3.471 ± 0.598
1.983GlyGlu: 1.983 ± 0.405
3.471GlyPhe: 3.471 ± 0.866
2.231GlyGly: 2.231 ± 0.823
1.983GlyHis: 1.983 ± 0.686
4.71GlyIle: 4.71 ± 1.859
3.718GlyLys: 3.718 ± 1.778
4.71GlyLeu: 4.71 ± 0.553
1.735GlyMet: 1.735 ± 0.643
2.231GlyAsn: 2.231 ± 1.06
1.487GlyPro: 1.487 ± 0.675
1.487GlyGln: 1.487 ± 0.458
2.479GlyArg: 2.479 ± 0.787
6.941GlySer: 6.941 ± 0.579
2.231GlyThr: 2.231 ± 0.591
3.718GlyVal: 3.718 ± 0.081
0.496GlyTrp: 0.496 ± 0.145
1.735GlyTyr: 1.735 ± 0.609
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.496HisCys: 0.496 ± 0.563
0.992HisAsp: 0.992 ± 0.339
0.496HisGlu: 0.496 ± 0.315
1.487HisPhe: 1.487 ± 0.675
2.231HisGly: 2.231 ± 0.718
0.0HisHis: 0.0 ± 0.0
1.487HisIle: 1.487 ± 0.415
0.992HisLys: 0.992 ± 0.339
1.983HisLeu: 1.983 ± 0.857
0.744HisMet: 0.744 ± 0.472
1.239HisAsn: 1.239 ± 0.323
0.744HisPro: 0.744 ± 0.548
1.487HisGln: 1.487 ± 0.728
2.231HisArg: 2.231 ± 0.616
2.727HisSer: 2.727 ± 1.033
0.992HisThr: 0.992 ± 0.339
1.983HisVal: 1.983 ± 0.686
0.0HisTrp: 0.0 ± 0.0
1.239HisTyr: 1.239 ± 0.486
0.0HisXaa: 0.0 ± 0.0
Ile
3.718IleAla: 3.718 ± 2.437
1.983IleCys: 1.983 ± 0.844
3.966IleAsp: 3.966 ± 1.013
4.214IleGlu: 4.214 ± 0.803
3.223IlePhe: 3.223 ± 0.511
2.975IleGly: 2.975 ± 0.773
1.487IleHis: 1.487 ± 0.435
4.214IleIle: 4.214 ± 0.709
2.975IleLys: 2.975 ± 0.831
5.206IleLeu: 5.206 ± 1.448
1.239IleMet: 1.239 ± 0.327
3.718IleAsn: 3.718 ± 1.122
2.727IlePro: 2.727 ± 0.892
2.231IleGln: 2.231 ± 1.093
3.471IleArg: 3.471 ± 1.218
6.693IleSer: 6.693 ± 1.311
2.727IleThr: 2.727 ± 0.519
2.231IleVal: 2.231 ± 0.948
0.744IleTrp: 0.744 ± 0.364
0.992IleTyr: 0.992 ± 0.63
0.0IleXaa: 0.0 ± 0.0
Lys
3.471LysAla: 3.471 ± 0.742
1.487LysCys: 1.487 ± 0.675
4.462LysAsp: 4.462 ± 1.205
4.71LysGlu: 4.71 ± 0.742
1.735LysPhe: 1.735 ± 0.333
2.975LysGly: 2.975 ± 0.639
0.744LysHis: 0.744 ± 0.208
1.983LysIle: 1.983 ± 0.857
2.727LysLys: 2.727 ± 0.877
5.702LysLeu: 5.702 ± 1.694
3.471LysMet: 3.471 ± 1.444
2.231LysAsn: 2.231 ± 0.38
2.479LysPro: 2.479 ± 0.972
3.718LysGln: 3.718 ± 1.11
3.223LysArg: 3.223 ± 1.2
5.702LysSer: 5.702 ± 0.82
2.727LysThr: 2.727 ± 0.3
4.71LysVal: 4.71 ± 1.158
1.983LysTrp: 1.983 ± 0.686
1.983LysTyr: 1.983 ± 0.273
0.0LysXaa: 0.0 ± 0.0
Leu
5.949LeuAla: 5.949 ± 1.376
1.983LeuCys: 1.983 ± 0.522
4.214LeuAsp: 4.214 ± 1.655
4.958LeuGlu: 4.958 ± 1.233
3.966LeuPhe: 3.966 ± 0.724
3.718LeuGly: 3.718 ± 0.767
2.975LeuHis: 2.975 ± 1.03
6.197LeuIle: 6.197 ± 2.136
6.693LeuLys: 6.693 ± 1.938
7.933LeuLeu: 7.933 ± 1.448
2.975LeuMet: 2.975 ± 0.831
3.223LeuAsn: 3.223 ± 0.779
3.966LeuPro: 3.966 ± 2.018
3.223LeuGln: 3.223 ± 1.082
8.18LeuArg: 8.18 ± 0.776
6.197LeuSer: 6.197 ± 0.846
6.693LeuThr: 6.693 ± 0.923
3.966LeuVal: 3.966 ± 0.528
0.496LeuTrp: 0.496 ± 0.145
2.727LeuTyr: 2.727 ± 0.493
0.0LeuXaa: 0.0 ± 0.0
Met
1.983MetAla: 1.983 ± 0.522
0.992MetCys: 0.992 ± 0.29
3.471MetAsp: 3.471 ± 1.402
2.727MetGlu: 2.727 ± 1.123
1.487MetPhe: 1.487 ± 0.638
1.735MetGly: 1.735 ± 0.792
0.496MetHis: 0.496 ± 0.54
2.727MetIle: 2.727 ± 0.478
1.983MetLys: 1.983 ± 0.495
2.727MetLeu: 2.727 ± 0.787
2.727MetMet: 2.727 ± 0.482
1.239MetAsn: 1.239 ± 0.486
1.487MetPro: 1.487 ± 0.41
0.992MetGln: 0.992 ± 0.339
1.487MetArg: 1.487 ± 0.41
1.983MetSer: 1.983 ± 0.971
1.735MetThr: 1.735 ± 0.453
1.239MetVal: 1.239 ± 0.787
0.248MetTrp: 0.248 ± 0.22
0.992MetTyr: 0.992 ± 0.339
0.0MetXaa: 0.0 ± 0.0
Asn
1.487AsnAla: 1.487 ± 0.656
0.992AsnCys: 0.992 ± 0.44
1.735AsnAsp: 1.735 ± 1.206
1.735AsnGlu: 1.735 ± 0.792
1.983AsnPhe: 1.983 ± 0.579
1.735AsnGly: 1.735 ± 0.358
1.487AsnHis: 1.487 ± 0.329
1.735AsnIle: 1.735 ± 0.529
2.975AsnLys: 2.975 ± 0.384
5.206AsnLeu: 5.206 ± 1.824
0.744AsnMet: 0.744 ± 0.364
0.992AsnAsn: 0.992 ± 0.737
2.727AsnPro: 2.727 ± 0.981
1.487AsnGln: 1.487 ± 0.675
2.479AsnArg: 2.479 ± 0.356
4.214AsnSer: 4.214 ± 0.851
1.735AsnThr: 1.735 ± 0.887
2.727AsnVal: 2.727 ± 0.761
0.496AsnTrp: 0.496 ± 0.563
1.735AsnTyr: 1.735 ± 0.453
0.0AsnXaa: 0.0 ± 0.0
Pro
2.479ProAla: 2.479 ± 0.377
0.0ProCys: 0.0 ± 0.0
3.471ProAsp: 3.471 ± 0.761
2.231ProGlu: 2.231 ± 0.508
1.983ProPhe: 1.983 ± 0.579
4.462ProGly: 4.462 ± 1.285
0.744ProHis: 0.744 ± 0.472
0.992ProIle: 0.992 ± 0.29
1.239ProLys: 1.239 ± 0.322
3.223ProLeu: 3.223 ± 0.613
0.744ProMet: 0.744 ± 0.428
1.487ProAsn: 1.487 ± 0.477
0.992ProPro: 0.992 ± 0.63
0.992ProGln: 0.992 ± 0.308
2.231ProArg: 2.231 ± 0.449
3.966ProSer: 3.966 ± 0.732
1.487ProThr: 1.487 ± 0.252
2.975ProVal: 2.975 ± 0.731
1.735ProTrp: 1.735 ± 0.587
1.239ProTyr: 1.239 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
1.487GlnAla: 1.487 ± 1.085
1.239GlnCys: 1.239 ± 0.323
2.727GlnAsp: 2.727 ± 0.926
2.479GlnGlu: 2.479 ± 0.304
0.496GlnPhe: 0.496 ± 0.44
1.983GlnGly: 1.983 ± 0.281
1.239GlnHis: 1.239 ± 0.486
1.735GlnIle: 1.735 ± 0.54
2.727GlnLys: 2.727 ± 0.708
4.462GlnLeu: 4.462 ± 2.292
1.735GlnMet: 1.735 ± 0.792
0.248GlnAsn: 0.248 ± 0.22
1.239GlnPro: 1.239 ± 0.43
1.239GlnGln: 1.239 ± 0.323
1.735GlnArg: 1.735 ± 0.358
1.983GlnSer: 1.983 ± 1.26
1.487GlnThr: 1.487 ± 0.49
2.727GlnVal: 2.727 ± 0.782
0.248GlnTrp: 0.248 ± 0.157
0.496GlnTyr: 0.496 ± 0.44
0.0GlnXaa: 0.0 ± 0.0
Arg
4.214ArgAla: 4.214 ± 1.003
2.231ArgCys: 2.231 ± 1.013
3.718ArgAsp: 3.718 ± 1.121
4.214ArgGlu: 4.214 ± 0.667
1.487ArgPhe: 1.487 ± 0.458
3.966ArgGly: 3.966 ± 1.962
0.744ArgHis: 0.744 ± 0.208
3.966ArgIle: 3.966 ± 0.555
3.471ArgLys: 3.471 ± 0.833
4.71ArgLeu: 4.71 ± 0.606
1.983ArgMet: 1.983 ± 0.947
2.479ArgAsn: 2.479 ± 1.025
2.231ArgPro: 2.231 ± 0.623
1.239ArgGln: 1.239 ± 0.486
2.727ArgArg: 2.727 ± 0.519
2.975ArgSer: 2.975 ± 0.43
3.223ArgThr: 3.223 ± 0.639
4.958ArgVal: 4.958 ± 1.374
0.496ArgTrp: 0.496 ± 0.315
2.479ArgTyr: 2.479 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
4.958SerAla: 4.958 ± 1.323
3.223SerCys: 3.223 ± 1.914
5.949SerAsp: 5.949 ± 1.208
4.71SerGlu: 4.71 ± 1.038
3.966SerPhe: 3.966 ± 0.946
5.949SerGly: 5.949 ± 1.538
2.975SerHis: 2.975 ± 0.714
4.71SerIle: 4.71 ± 1.304
5.206SerLys: 5.206 ± 1.454
6.941SerLeu: 6.941 ± 1.446
2.727SerMet: 2.727 ± 1.104
3.718SerAsn: 3.718 ± 0.88
1.983SerPro: 1.983 ± 0.678
3.471SerGln: 3.471 ± 0.717
4.462SerArg: 4.462 ± 0.263
10.164SerSer: 10.164 ± 1.266
4.958SerThr: 4.958 ± 0.871
3.966SerVal: 3.966 ± 1.043
2.231SerTrp: 2.231 ± 0.591
2.231SerTyr: 2.231 ± 0.449
0.0SerXaa: 0.0 ± 0.0
Thr
1.983ThrAla: 1.983 ± 0.579
1.487ThrCys: 1.487 ± 0.415
3.223ThrAsp: 3.223 ± 0.639
3.471ThrGlu: 3.471 ± 0.887
1.983ThrPhe: 1.983 ± 0.971
3.223ThrGly: 3.223 ± 0.472
0.992ThrHis: 0.992 ± 0.29
2.975ThrIle: 2.975 ± 0.505
2.479ThrLys: 2.479 ± 0.36
6.445ThrLeu: 6.445 ± 1.13
1.487ThrMet: 1.487 ± 0.415
1.735ThrAsn: 1.735 ± 0.453
2.479ThrPro: 2.479 ± 0.724
2.479ThrGln: 2.479 ± 0.304
3.718ThrArg: 3.718 ± 0.741
6.693ThrSer: 6.693 ± 1.88
2.479ThrThr: 2.479 ± 0.643
3.718ThrVal: 3.718 ± 0.576
0.0ThrTrp: 0.0 ± 0.0
0.496ThrTyr: 0.496 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
4.462ValAla: 4.462 ± 1.165
1.983ValCys: 1.983 ± 0.517
4.71ValAsp: 4.71 ± 1.304
4.958ValGlu: 4.958 ± 1.694
1.487ValPhe: 1.487 ± 0.435
3.966ValGly: 3.966 ± 0.859
1.983ValHis: 1.983 ± 0.617
3.966ValIle: 3.966 ± 0.701
4.462ValLys: 4.462 ± 0.715
3.966ValLeu: 3.966 ± 0.588
1.239ValMet: 1.239 ± 0.787
2.231ValAsn: 2.231 ± 0.678
0.744ValPro: 0.744 ± 0.66
2.231ValGln: 2.231 ± 1.06
3.966ValArg: 3.966 ± 0.858
8.18ValSer: 8.18 ± 0.692
4.214ValThr: 4.214 ± 0.941
4.214ValVal: 4.214 ± 1.0
1.487ValTrp: 1.487 ± 0.711
2.231ValTyr: 2.231 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 0.825
0.248TrpCys: 0.248 ± 0.157
0.248TrpAsp: 0.248 ± 0.22
0.744TrpGlu: 0.744 ± 0.472
0.744TrpPhe: 0.744 ± 0.364
0.496TrpGly: 0.496 ± 0.315
0.248TrpHis: 0.248 ± 0.22
0.992TrpIle: 0.992 ± 0.551
0.744TrpLys: 0.744 ± 0.548
0.496TrpLeu: 0.496 ± 0.315
0.992TrpMet: 0.992 ± 0.339
0.496TrpAsn: 0.496 ± 0.145
0.744TrpPro: 0.744 ± 0.364
0.0TrpGln: 0.0 ± 0.0
0.248TrpArg: 0.248 ± 0.157
0.992TrpSer: 0.992 ± 0.308
1.735TrpThr: 1.735 ± 0.883
1.983TrpVal: 1.983 ± 0.405
0.248TrpTrp: 0.248 ± 0.157
0.248TrpTyr: 0.248 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.239TyrAla: 1.239 ± 0.613
0.248TyrCys: 0.248 ± 0.157
1.735TyrAsp: 1.735 ± 0.529
1.735TyrGlu: 1.735 ± 0.685
0.744TyrPhe: 0.744 ± 0.472
2.479TyrGly: 2.479 ± 0.631
0.992TyrHis: 0.992 ± 0.29
1.983TyrIle: 1.983 ± 0.273
1.487TyrLys: 1.487 ± 0.945
3.223TyrLeu: 3.223 ± 0.659
0.496TyrMet: 0.496 ± 0.54
1.487TyrAsn: 1.487 ± 0.945
1.239TyrPro: 1.239 ± 0.613
0.744TyrGln: 0.744 ± 0.63
0.744TyrArg: 0.744 ± 0.338
2.727TyrSer: 2.727 ± 0.812
1.239TyrThr: 1.239 ± 0.486
2.231TyrVal: 2.231 ± 0.812
0.248TyrTrp: 0.248 ± 0.22
0.744TyrTyr: 0.744 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski