scalability

Need recommendations on pushing the envelope with SqlBulkCopy on SQL Server

痞子三分冷 提交于 2019-12-04 10:52:50
I am designing an application, one aspect of which is that it is supposed to be able to receive massive amounts of data into SQL database. I designed the database stricture as a single table with bigint identity, something like this one: CREATE TABLE MainTable ( _id bigint IDENTITY(1,1) NOT NULL PRIMARY KEY CLUSTERED, field1, field2, ... ) I will omit how am I intending to perform queries, since it is irrelevant to the question I have. I have written a prototype, which inserts data into this table using SqlBulkCopy. It seemed to work very well in the lab. I was able to insert tens of millions

Counting socket.io users across horizontal servers

丶灬走出姿态 提交于 2019-12-04 09:45:19
问题 I have multiple socket.io servers scaled horizontally using a redisstore. I've got rooms setup effectively and am successfully able to broadcast to rooms across servers, etc. Now I'm trying to build a status page and what I'm failing on figuring out is how to simply count the number of users connected across all servers. io.sockets.clients('room') and io.sockets.sockets will only tell you the number of connected clients on that one server, not all servers connected to the same RedisStore.

Engineering scalability into an application

可紊 提交于 2019-12-04 09:01:43
问题 What does it mean to say - Engineering scalability into applications. Are there design patterns that would make an application more scalable? This question is mainly in the context of web applications or SOA middleware based applications. 回答1: When I think about "large scale applications" I think of three very different things: Applications that will run across a large scale-out cluster (much larger than 1024 cores). Applications that will deal with data sets that are much larger than

Books for Building Scalable Web Applications? (DB Performance/Tuning, Networking, General Performance, etc.) [closed]

喜欢而已 提交于 2019-12-04 08:37:39
After finishing school in Computer Science and entering the "real world" as a software engineer working on web applications, I've become overwhelmed by the amount of information to be learned about scaling web applications properly. Some topics/questions that have recently popped up for me: RDBMS's vs. unstructured data storage. Advantages and disadvantages of sharding for databases, search indexes, etc. Which network file systems scale? Which don't? The cost of HTTP and DB connections. Static content delivery, why not to store images in a database, etc. Why is it better to use one thread pool

socket: Too many open files (24) apache bench lighttpd

眉间皱痕 提交于 2019-12-04 07:40:58
问题 When I start Apache Bench test: ab -n 10000 -c 1300 http://domain.com/test.php I get error: socket: Too many open files (24) When i change to '-c 1000' it works fine. Because I can have more than 1000 concurrent users I would like to fix socket too many open files problem or increase parameter. How to do this and where? I use lighttpd on centos 5. 回答1: ulimit -n 10000 That might not work depending on you system configuration Consult this to configure your system. 回答2: to permernent change max

No PHP for large projects? Why not?

淺唱寂寞╮ 提交于 2019-12-04 07:33:52
问题 I've read a few posts where people have stated (not suggested, not discussed, not offered) that PHP should not be used for large projects. Being a primarily PHP developer, I ask two questions: What defines a "large project"? Why not? What are the pitfalls of using PHP I run a small development team and I know from experience the quality construction, organization, documentation, commenting and encapsulation are our highest priority. We are able to develop great projects using our own

What is the difference between scalability and elasticity?

 ̄綄美尐妖づ 提交于 2019-12-04 07:29:44
问题 I've heard many people using both terms interchangeably. However, in my opinion there is difference between them: SCALABILITY - ability of a system to increase the workload on its current hardware resources ( scale up ); ELASTICITY - ability of a system to increase the workload on its current and additional (dynamically added on demand) hardware resources ( scale out ); Elasticity is strongly related to deployed-on-cloud applications. Have I got the difference between these two non-functional

What is considered a good response time for a dynamic, personalized web application? [closed]

两盒软妹~` 提交于 2019-12-04 07:25:33
问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 7 years ago . For a complex web application that includes dynamic content and personalization, what is a good response time from the server (so

How to scale psycopg2 insert and select with single process in python?

家住魔仙堡 提交于 2019-12-04 06:14:12
问题 It takes average of about 0.300095081329 for my insert to go through to finish commit to postgres. Here is my table pattern id_table latest_update_id (primary index) product_id (index) publish_date product_meta_table latest_update_id (index) product_id (index) meta_related_info1 meta_related_info2 ...etc product_table latest_update_id (index) product_id (index) note_related_info1 note_related_info2 ....etc Here are some of my inserts db_cursor.execute("INSERT INTO id_table (product_id,

Reasons for NOT scaling-up vs. -out?

旧时模样 提交于 2019-12-04 05:55:31
As a programmer I make revolutionary findings every few years. I'm either ahead of the curve, or behind it by about π in the phase. One hard lesson I learned was that scaling OUT is not always better, quite often the biggest performance gains are when we regrouped and scaled up. What reasons to you have for scaling out vs. up? Price, performance, vision, projected usage? If so, how did this work for you? We once scaled out to several hundred nodes that would serialize and cache necessary data out to each node and run maths processes on the records. Many, many billions of records needed to be