Amino acid dipepetide frequency for Nephila clavipes virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.07AlaAla: 2.07 ± 0.603
0.517AlaCys: 0.517 ± 0.265
2.846AlaAsp: 2.846 ± 0.259
2.587AlaGlu: 2.587 ± 1.249
3.622AlaPhe: 3.622 ± 1.46
2.587AlaGly: 2.587 ± 1.249
0.517AlaHis: 0.517 ± 0.695
3.364AlaIle: 3.364 ± 1.025
2.587AlaLys: 2.587 ± 0.46
3.364AlaLeu: 3.364 ± 1.359
1.035AlaMet: 1.035 ± 0.53
2.07AlaAsn: 2.07 ± 0.408
1.035AlaPro: 1.035 ± 0.704
0.776AlaGln: 0.776 ± 0.646
1.294AlaArg: 1.294 ± 0.254
2.07AlaSer: 2.07 ± 0.408
2.587AlaThr: 2.587 ± 0.362
4.14AlaVal: 4.14 ± 1.111
0.0AlaTrp: 0.0 ± 0.0
2.07AlaTyr: 2.07 ± 0.575
0.0AlaXaa: 0.0 ± 0.0
Cys
2.07CysAla: 2.07 ± 0.575
0.0CysCys: 0.0 ± 0.0
1.035CysAsp: 1.035 ± 0.53
0.776CysGlu: 0.776 ± 0.397
0.517CysPhe: 0.517 ± 0.265
1.552CysGly: 1.552 ± 0.499
0.776CysHis: 0.776 ± 0.781
1.035CysIle: 1.035 ± 1.228
0.776CysLys: 0.776 ± 0.816
3.622CysLeu: 3.622 ± 0.85
1.035CysMet: 1.035 ± 0.621
3.622CysAsn: 3.622 ± 0.699
2.329CysPro: 2.329 ± 1.791
0.0CysGln: 0.0 ± 0.0
0.776CysArg: 0.776 ± 1.343
1.811CysSer: 1.811 ± 0.443
0.776CysThr: 0.776 ± 0.25
2.07CysVal: 2.07 ± 1.349
0.0CysTrp: 0.0 ± 0.0
1.294CysTyr: 1.294 ± 0.662
0.0CysXaa: 0.0 ± 0.0
Asp
2.587AspAla: 2.587 ± 1.086
2.329AspCys: 2.329 ± 1.192
4.657AspAsp: 4.657 ± 1.34
2.07AspGlu: 2.07 ± 0.575
6.468AspPhe: 6.468 ± 1.543
1.294AspGly: 1.294 ± 0.662
0.517AspHis: 0.517 ± 0.337
6.986AspIle: 6.986 ± 1.352
1.811AspLys: 1.811 ± 0.455
7.503AspLeu: 7.503 ± 3.839
1.294AspMet: 1.294 ± 0.988
4.657AspAsn: 4.657 ± 0.227
1.294AspPro: 1.294 ± 0.662
1.035AspGln: 1.035 ± 0.53
3.881AspArg: 3.881 ± 1.028
4.916AspSer: 4.916 ± 0.826
2.846AspThr: 2.846 ± 0.259
3.881AspVal: 3.881 ± 1.028
0.259AspTrp: 0.259 ± 0.132
2.07AspTyr: 2.07 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
1.035GluAla: 1.035 ± 0.53
1.811GluCys: 1.811 ± 0.446
2.329GluAsp: 2.329 ± 1.192
3.105GluGlu: 3.105 ± 0.196
4.398GluPhe: 4.398 ± 0.648
1.811GluGly: 1.811 ± 0.455
0.517GluHis: 0.517 ± 0.265
3.881GluIle: 3.881 ± 1.028
3.105GluLys: 3.105 ± 1.022
5.433GluLeu: 5.433 ± 1.328
0.776GluMet: 0.776 ± 0.25
3.105GluAsn: 3.105 ± 1.183
2.07GluPro: 2.07 ± 0.575
0.517GluGln: 0.517 ± 0.265
1.035GluArg: 1.035 ± 0.53
0.776GluSer: 0.776 ± 0.397
2.329GluThr: 2.329 ± 1.137
2.846GluVal: 2.846 ± 1.108
0.0GluTrp: 0.0 ± 0.0
3.105GluTyr: 3.105 ± 1.589
0.0GluXaa: 0.0 ± 0.0
Phe
2.07PheAla: 2.07 ± 1.349
2.329PheCys: 2.329 ± 0.414
4.657PheAsp: 4.657 ± 0.828
1.811PheGlu: 1.811 ± 0.73
7.245PhePhe: 7.245 ± 2.493
3.364PheGly: 3.364 ± 0.211
2.846PheHis: 2.846 ± 0.701
3.105PheIle: 3.105 ± 2.113
3.622PheLys: 3.622 ± 0.699
12.16PheLeu: 12.16 ± 1.391
1.552PheMet: 1.552 ± 0.644
4.657PheAsn: 4.657 ± 1.098
3.881PhePro: 3.881 ± 1.302
0.776PheGln: 0.776 ± 0.397
2.329PheArg: 2.329 ± 0.479
11.384PheSer: 11.384 ± 3.813
5.951PheThr: 5.951 ± 1.936
6.727PheVal: 6.727 ± 3.029
0.259PheTrp: 0.259 ± 0.132
4.14PheTyr: 4.14 ± 1.185
0.0PheXaa: 0.0 ± 0.0
Gly
1.035GlyAla: 1.035 ± 0.53
1.035GlyCys: 1.035 ± 0.214
3.622GlyAsp: 3.622 ± 0.891
0.776GlyGlu: 0.776 ± 0.25
2.587GlyPhe: 2.587 ± 0.69
3.622GlyGly: 3.622 ± 1.345
0.776GlyHis: 0.776 ± 0.25
2.329GlyIle: 2.329 ± 0.87
2.329GlyLys: 2.329 ± 0.7
4.14GlyLeu: 4.14 ± 0.045
0.0GlyMet: 0.0 ± 0.0
3.364GlyAsn: 3.364 ± 1.215
0.517GlyPro: 0.517 ± 0.695
1.294GlyGln: 1.294 ± 0.601
3.364GlyArg: 3.364 ± 1.025
3.364GlySer: 3.364 ± 0.658
2.587GlyThr: 2.587 ± 0.833
3.364GlyVal: 3.364 ± 1.298
0.259GlyTrp: 0.259 ± 0.132
2.07GlyTyr: 2.07 ± 1.059
0.0GlyXaa: 0.0 ± 0.0
His
1.035HisAla: 1.035 ± 0.621
0.517HisCys: 0.517 ± 0.337
0.776HisAsp: 0.776 ± 0.397
0.776HisGlu: 0.776 ± 0.397
1.035HisPhe: 1.035 ± 0.214
0.259HisGly: 0.259 ± 0.132
0.776HisHis: 0.776 ± 0.781
0.776HisIle: 0.776 ± 0.397
0.776HisLys: 0.776 ± 0.816
2.587HisLeu: 2.587 ± 1.157
0.0HisMet: 0.0 ± 0.0
0.776HisAsn: 0.776 ± 0.397
1.035HisPro: 1.035 ± 1.118
1.294HisGln: 1.294 ± 0.662
1.035HisArg: 1.035 ± 0.53
2.587HisSer: 2.587 ± 0.508
1.552HisThr: 1.552 ± 0.344
1.552HisVal: 1.552 ± 0.344
0.0HisTrp: 0.0 ± 0.0
0.776HisTyr: 0.776 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
3.622IleAla: 3.622 ± 4.056
1.294IleCys: 1.294 ± 0.601
5.692IleAsp: 5.692 ± 1.91
3.622IleGlu: 3.622 ± 0.85
8.538IlePhe: 8.538 ± 1.103
1.811IleGly: 1.811 ± 0.913
1.552IleHis: 1.552 ± 0.499
3.622IleIle: 3.622 ± 3.268
4.14IleLys: 4.14 ± 1.15
4.657IleLeu: 4.657 ± 1.497
0.517IleMet: 0.517 ± 0.265
4.657IleAsn: 4.657 ± 1.031
2.846IlePro: 2.846 ± 0.955
0.259IleGln: 0.259 ± 0.132
3.622IleArg: 3.622 ± 1.066
7.503IleSer: 7.503 ± 1.445
1.811IleThr: 1.811 ± 0.709
4.398IleVal: 4.398 ± 1.082
0.259IleTrp: 0.259 ± 0.132
2.846IleTyr: 2.846 ± 0.701
0.0IleXaa: 0.0 ± 0.0
Lys
1.552LysAla: 1.552 ± 0.655
0.776LysCys: 0.776 ± 0.781
3.881LysAsp: 3.881 ± 1.028
3.364LysGlu: 3.364 ± 1.721
5.951LysPhe: 5.951 ± 1.149
1.552LysGly: 1.552 ± 0.344
0.259LysHis: 0.259 ± 0.132
5.175LysIle: 5.175 ± 1.034
4.657LysLys: 4.657 ± 1.34
5.951LysLeu: 5.951 ± 0.671
2.07LysMet: 2.07 ± 0.782
4.916LysAsn: 4.916 ± 0.826
2.846LysPro: 2.846 ± 0.259
1.294LysGln: 1.294 ± 0.254
2.329LysArg: 2.329 ± 0.451
3.881LysSer: 3.881 ± 0.969
3.364LysThr: 3.364 ± 0.737
2.329LysVal: 2.329 ± 0.414
0.0LysTrp: 0.0 ± 0.0
2.587LysTyr: 2.587 ± 1.324
0.0LysXaa: 0.0 ± 0.0
Leu
3.881LeuAla: 3.881 ± 1.476
2.07LeuCys: 2.07 ± 0.825
6.986LeuAsp: 6.986 ± 0.493
7.762LeuGlu: 7.762 ± 0.881
6.727LeuPhe: 6.727 ± 4.851
4.14LeuGly: 4.14 ± 1.15
1.811LeuHis: 1.811 ± 0.455
8.538LeuIle: 8.538 ± 1.902
7.503LeuLys: 7.503 ± 2.352
12.16LeuLeu: 12.16 ± 1.408
2.329LeuMet: 2.329 ± 0.966
6.468LeuAsn: 6.468 ± 1.294
4.14LeuPro: 4.14 ± 1.15
2.329LeuGln: 2.329 ± 0.479
4.657LeuArg: 4.657 ± 1.34
8.797LeuSer: 8.797 ± 2.679
4.14LeuThr: 4.14 ± 0.856
7.762LeuVal: 7.762 ± 2.665
1.035LeuTrp: 1.035 ± 0.53
4.14LeuTyr: 4.14 ± 2.118
0.0LeuXaa: 0.0 ± 0.0
Met
2.587MetAla: 2.587 ± 1.534
1.035MetCys: 1.035 ± 0.214
0.776MetAsp: 0.776 ± 0.397
0.259MetGlu: 0.259 ± 0.132
1.294MetPhe: 1.294 ± 0.254
1.811MetGly: 1.811 ± 0.709
0.517MetHis: 0.517 ± 0.265
1.035MetIle: 1.035 ± 0.621
0.517MetLys: 0.517 ± 0.265
2.07MetLeu: 2.07 ± 1.242
0.259MetMet: 0.259 ± 0.132
1.035MetAsn: 1.035 ± 0.214
0.259MetPro: 0.259 ± 0.132
0.517MetGln: 0.517 ± 0.265
1.552MetArg: 1.552 ± 0.344
1.035MetSer: 1.035 ± 0.214
0.517MetThr: 0.517 ± 0.265
1.035MetVal: 1.035 ± 0.621
0.259MetTrp: 0.259 ± 0.132
0.776MetTyr: 0.776 ± 0.781
0.0MetXaa: 0.0 ± 0.0
Asn
1.294AsnAla: 1.294 ± 0.662
3.364AsnCys: 3.364 ± 0.658
2.329AsnAsp: 2.329 ± 0.7
2.07AsnGlu: 2.07 ± 0.575
7.245AsnPhe: 7.245 ± 3.17
2.846AsnGly: 2.846 ± 0.59
2.329AsnHis: 2.329 ± 0.7
4.398AsnIle: 4.398 ± 1.569
4.398AsnLys: 4.398 ± 1.781
6.21AsnLeu: 6.21 ± 1.97
1.294AsnMet: 1.294 ± 0.564
4.657AsnAsn: 4.657 ± 0.382
2.07AsnPro: 2.07 ± 0.428
1.035AsnGln: 1.035 ± 0.53
3.622AsnArg: 3.622 ± 0.293
5.692AsnSer: 5.692 ± 0.756
2.329AsnThr: 2.329 ± 0.451
4.398AsnVal: 4.398 ± 1.058
0.259AsnTrp: 0.259 ± 0.132
1.811AsnTyr: 1.811 ± 0.446
0.0AsnXaa: 0.0 ± 0.0
Pro
1.294ProAla: 1.294 ± 0.662
0.776ProCys: 0.776 ± 0.781
2.07ProAsp: 2.07 ± 1.349
1.811ProGlu: 1.811 ± 0.455
2.07ProPhe: 2.07 ± 0.825
1.811ProGly: 1.811 ± 0.927
0.517ProHis: 0.517 ± 0.337
3.364ProIle: 3.364 ± 1.178
2.07ProLys: 2.07 ± 1.64
3.622ProLeu: 3.622 ± 1.419
1.035ProMet: 1.035 ± 0.53
1.294ProAsn: 1.294 ± 0.988
1.811ProPro: 1.811 ± 2.009
0.776ProGln: 0.776 ± 0.397
1.552ProArg: 1.552 ± 0.499
5.692ProSer: 5.692 ± 1.608
1.294ProThr: 1.294 ± 0.254
4.657ProVal: 4.657 ± 1.505
0.259ProTrp: 0.259 ± 0.132
2.07ProTyr: 2.07 ± 0.408
0.0ProXaa: 0.0 ± 0.0
Gln
1.035GlnAla: 1.035 ± 0.214
0.0GlnCys: 0.0 ± 0.0
1.035GlnAsp: 1.035 ± 0.53
0.259GlnGlu: 0.259 ± 0.764
1.811GlnPhe: 1.811 ± 0.709
1.552GlnGly: 1.552 ± 0.655
0.776GlnHis: 0.776 ± 0.397
1.035GlnIle: 1.035 ± 0.675
1.294GlnLys: 1.294 ± 0.662
2.07GlnLeu: 2.07 ± 0.575
0.259GlnMet: 0.259 ± 0.132
1.552GlnAsn: 1.552 ± 0.499
0.259GlnPro: 0.259 ± 0.132
1.035GlnGln: 1.035 ± 0.621
1.294GlnArg: 1.294 ± 0.662
1.035GlnSer: 1.035 ± 0.53
0.776GlnThr: 0.776 ± 0.646
0.259GlnVal: 0.259 ± 0.132
0.0GlnTrp: 0.0 ± 0.0
0.776GlnTyr: 0.776 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
2.846ArgAla: 2.846 ± 1.108
1.035ArgCys: 1.035 ± 0.214
3.364ArgAsp: 3.364 ± 1.215
2.587ArgGlu: 2.587 ± 1.324
3.881ArgPhe: 3.881 ± 1.028
1.035ArgGly: 1.035 ± 0.621
1.035ArgHis: 1.035 ± 0.53
4.14ArgIle: 4.14 ± 0.523
3.105ArgLys: 3.105 ± 1.589
3.105ArgLeu: 3.105 ± 1.681
0.776ArgMet: 0.776 ± 0.397
3.364ArgAsn: 3.364 ± 1.402
1.811ArgPro: 1.811 ± 1.454
0.259ArgGln: 0.259 ± 0.448
1.552ArgArg: 1.552 ± 0.794
2.846ArgSer: 2.846 ± 2.655
2.329ArgThr: 2.329 ± 0.451
2.587ArgVal: 2.587 ± 0.967
0.259ArgTrp: 0.259 ± 0.132
2.07ArgTyr: 2.07 ± 0.575
0.0ArgXaa: 0.0 ± 0.0
Ser
3.881SerAla: 3.881 ± 0.841
2.846SerCys: 2.846 ± 1.809
4.916SerAsp: 4.916 ± 1.138
4.14SerGlu: 4.14 ± 0.523
7.762SerPhe: 7.762 ± 3.99
5.175SerGly: 5.175 ± 1.818
1.294SerHis: 1.294 ± 0.662
4.398SerIle: 4.398 ± 1.135
6.21SerLys: 6.21 ± 1.725
6.21SerLeu: 6.21 ± 0.867
1.294SerMet: 1.294 ± 0.662
3.364SerAsn: 3.364 ± 2.006
2.587SerPro: 2.587 ± 0.833
2.07SerGln: 2.07 ± 0.782
3.622SerArg: 3.622 ± 1.427
6.468SerSer: 6.468 ± 1.364
6.727SerThr: 6.727 ± 1.473
6.21SerVal: 6.21 ± 1.375
0.0SerTrp: 0.0 ± 0.0
3.881SerTyr: 3.881 ± 1.302
0.0SerXaa: 0.0 ± 0.0
Thr
2.329ThrAla: 2.329 ± 1.937
0.517ThrCys: 0.517 ± 0.337
2.587ThrAsp: 2.587 ± 0.827
1.035ThrGlu: 1.035 ± 0.214
4.916ThrPhe: 4.916 ± 1.456
1.294ThrGly: 1.294 ± 0.254
0.259ThrHis: 0.259 ± 0.132
3.881ThrIle: 3.881 ± 0.762
4.398ThrLys: 4.398 ± 0.097
6.727ThrLeu: 6.727 ± 1.874
1.035ThrMet: 1.035 ± 0.214
2.846ThrAsn: 2.846 ± 0.536
1.294ThrPro: 1.294 ± 0.624
0.517ThrGln: 0.517 ± 0.265
2.07ThrArg: 2.07 ± 0.575
2.846ThrSer: 2.846 ± 0.955
3.364ThrThr: 3.364 ± 3.349
6.468ThrVal: 6.468 ± 2.14
1.035ThrTrp: 1.035 ± 0.214
2.329ThrTyr: 2.329 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
3.364ValAla: 3.364 ± 0.737
2.587ValCys: 2.587 ± 0.833
5.433ValAsp: 5.433 ± 1.162
2.587ValGlu: 2.587 ± 0.46
4.916ValPhe: 4.916 ± 2.503
3.105ValGly: 3.105 ± 1.029
1.294ValHis: 1.294 ± 0.254
3.364ValIle: 3.364 ± 1.025
2.587ValLys: 2.587 ± 1.086
10.608ValLeu: 10.608 ± 2.434
1.811ValMet: 1.811 ± 0.443
5.175ValAsn: 5.175 ± 1.016
5.951ValPro: 5.951 ± 1.813
1.035ValGln: 1.035 ± 0.53
2.846ValArg: 2.846 ± 0.536
4.657ValSer: 4.657 ± 0.94
3.622ValThr: 3.622 ± 0.891
6.727ValVal: 6.727 ± 1.473
0.259ValTrp: 0.259 ± 0.132
3.622ValTyr: 3.622 ± 0.304
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.517TrpCys: 0.517 ± 0.265
0.0TrpAsp: 0.0 ± 0.0
0.259TrpGlu: 0.259 ± 0.132
0.517TrpPhe: 0.517 ± 0.337
0.259TrpGly: 0.259 ± 0.132
0.0TrpHis: 0.0 ± 0.0
0.776TrpIle: 0.776 ± 0.397
0.259TrpLys: 0.259 ± 0.132
1.035TrpLeu: 1.035 ± 0.53
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.776TrpVal: 0.776 ± 0.397
0.0TrpTrp: 0.0 ± 0.0
0.259TrpTyr: 0.259 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.552TyrAla: 1.552 ± 0.794
0.517TyrCys: 0.517 ± 0.265
3.364TyrAsp: 3.364 ± 0.795
2.329TyrGlu: 2.329 ± 0.414
3.364TyrPhe: 3.364 ± 0.937
1.552TyrGly: 1.552 ± 0.344
1.552TyrHis: 1.552 ± 0.511
2.07TyrIle: 2.07 ± 0.575
2.587TyrLys: 2.587 ± 0.508
4.14TyrLeu: 4.14 ± 0.856
0.517TyrMet: 0.517 ± 0.265
2.329TyrAsn: 2.329 ± 1.192
1.811TyrPro: 1.811 ± 0.446
1.294TyrGln: 1.294 ± 0.254
1.552TyrArg: 1.552 ± 0.655
5.692TyrSer: 5.692 ± 0.621
2.846TyrThr: 2.846 ± 0.59
3.364TyrVal: 3.364 ± 0.795
0.259TyrTrp: 0.259 ± 0.132
3.622TyrTyr: 3.622 ± 0.699
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski